Method and system for minimizing attribute naming errors in set oriented duplicate detection
US5799302A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Mar 30, 1995 |
| Grant date | Aug 25, 1998 |
| Priority date | — |
| Expiry date | Mar 30, 2015 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06Q99/00
- WIPO fieldIT methods for management
- WIPO sectorElectrical engineering
Abstract
The invention is a method for detecting duplicate records on a list or in a file and comprises a number of steps. The steps include entering a list, comprised of one or more records, to a data processing system; then, applying a nickname lookup table to the records to determine a common first name. Once a common name has been determined, the method matches a first record from the list with a second record from the list by comparing the fields of the first record with the fields of at least one other record; the comparison is based on a set of pre-determined criteria. The matching sequence determines a duplicate set, wherein the duplicate set is comprised of at least two records with fields that match. The method then lists matching records sequentially so that the system can create a new record by filling each empty field with a next available corresponding field from a subsequent record within the duplicate set. The newly created record is then retained on the original list; and the duplicate records are placed on a second list. Pre-sorting of the list can occur just prior to the matching sequence as well as just prior to outputting the final list. Additionally, the system operato…
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.