Patent · US Active

Text-based fuzzy search

US8521759B2 · kind B2 · utility

3Cited by
2References
21Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 23, 2011
Grant dateAug 27, 2013
Priority date
Expiry dateMay 23, 2031

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/3347
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

An input feature vector is computed from an input text record, the input feature vector comprising one or more features, each feature including a subsequence of characters and a frequency of occurrence of the associated subsequence. A input fingerprint is generated out of the input feature vector by choosing one or more features with non-zero frequencies and alphabetizing the features chosen. One or more input indices are generated by alphabetizing features in the input fingerprint and concatenating features occurring in subsequent locations of the input fingerprint. The input text record is matched against a target text record if (1) one or more of the input indices match a target index corresponding to the target text record and (2) the corresponding input fingerprint matches a target fingerprint corresponding to the target text record. The target text record is outputted as a search result if it matches the input text record.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.