Patent · US Active

Enterprise data duplication identification

US8429137B2 · kind B2 · utility

2Cited by
4References
15Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 2, 2010
Grant dateApr 23, 2013
Priority date
Expiry dateApr 16, 2031

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/215
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems, methods, and computer program products are provided for identifying duplicate data. In one exemplary embodiment, there is provided a method for identifying duplicate data. The method may include identifying one or more reference fields that include one or more data values. The method may include retrieving the one or more reference fields and one or more data values. The method may also include transforming the one or more reference fields into one or more reference fingerprint patterns. The method may also include identifying one or more target fields that include one or more target field values. The method may also include retrieving the one or more target fields. The method may also include transforming the one or more target field values into one or more target fingerprint patterns. The method may also include comparing the one or more reference fingerprint patterns with the one or more target fingerprint patterns. The method may further include determining an overlap between the one or more reference fingerprint patterns and the one or more target fingerprint patterns.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.