Text-based fuzzy search
US8521759B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 23, 2011 |
| Grant date | Aug 27, 2013 |
| Priority date | — |
| Expiry date | May 23, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/3347
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An input feature vector is computed from an input text record, the input feature vector comprising one or more features, each feature including a subsequence of characters and a frequency of occurrence of the associated subsequence. A input fingerprint is generated out of the input feature vector by choosing one or more features with non-zero frequencies and alphabetizing the features chosen. One or more input indices are generated by alphabetizing features in the input fingerprint and concatenating features occurring in subsequent locations of the input fingerprint. The input text record is matched against a target text record if (1) one or more of the input indices match a target index corresponding to the target text record and (2) the corresponding input fingerprint matches a target fingerprint corresponding to the target text record. The target text record is outputted as a search result if it matches the input text record.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.