Patent · US Expired

Method and system for minimizing attribute naming errors in set oriented duplicate detection

US5799302A · kind A · utility

52Cited by
12References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 30, 1995
Grant dateAug 25, 1998
Priority date
Expiry dateMar 30, 2015

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06Q99/00
  • WIPO fieldIT methods for management
  • WIPO sectorElectrical engineering

Abstract

The invention is a method for detecting duplicate records on a list or in a file and comprises a number of steps. The steps include entering a list, comprised of one or more records, to a data processing system; then, applying a nickname lookup table to the records to determine a common first name. Once a common name has been determined, the method matches a first record from the list with a second record from the list by comparing the fields of the first record with the fields of at least one other record; the comparison is based on a set of pre-determined criteria. The matching sequence determines a duplicate set, wherein the duplicate set is comprised of at least two records with fields that match. The method then lists matching records sequentially so that the system can create a new record by filling each empty field with a next available corresponding field from a subsequent record within the duplicate set. The newly created record is then retained on the original list; and the duplicate records are placed on a second list. Pre-sorting of the list can occur just prior to the matching sequence as well as just prior to outputting the final list. Additionally, the system operato…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.