Patent · US Active

System and method for genetic creation of a rule set for duplicate detection

US8577814B1 · kind B1 · utility

6Cited by
2References
30Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 28, 2011
Grant dateNov 5, 2013
Priority date
Expiry dateApr 24, 2032

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/126
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Embodiments may generate a population of candidate rules including multiple rule conditions for detecting duplicates, each duplicate representing different sets of item description information that describe a common item. For each candidate rule of the population, embodiments may apply that rule to a reference data set including known duplicates and non-duplicates. Embodiments may assign each candidate rule a fitness score generated with a fitness function based on the performance of that candidate rule. Embodiments may, based on the fitness scores, select a subset of the population of candidate rules as parents for the new generation of candidate rules. Embodiments may perform crossover and/or mutation operations on the parent candidate rules to generate the new generation of candidate rules. Embodiments may select from the new generation of candidate rules (or from subsequent generations of candidate rules), rules for inclusion within a rule set for detecting duplicates within item description information.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.