Matching engine with signature generation
US8171002B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 17, 2009 |
| Grant date | May 1, 2012 |
| Priority date | — |
| Expiry date | Aug 30, 2030 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99935
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system and a method generates at least one signature associated with document. In one embodiment, a document comprised of text is received and parsed to generate a token set. The token set includes a plurality of tokens. Each token corresponds to the text in the document that is separated by a predefined character characteristic. A score is calculated for each token in the token set based on a frequency and distribution of the text in the document. Each token is then ranked based on the calculated score. A subset of the ranked tokes is selected and a signature is generated for each occurrence of the selected tokens. The selected list of signatures is then output.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.