Patent · US Active

Matching engine with signature generation

US8171002B2 · kind B2 · utility

0Cited by
17References
12Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 17, 2009
Grant dateMay 1, 2012
Priority date
Expiry dateAug 30, 2030

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99935
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and a method generates at least one signature associated with document. In one embodiment, a document comprised of text is received and parsed to generate a token set. The token set includes a plurality of tokens. Each token corresponds to the text in the document that is separated by a predefined character characteristic. A score is calculated for each token in the token set based on a frequency and distribution of the text in the document. Each token is then ranked based on the calculated score. A subset of the ranked tokes is selected and a signature is generated for each occurrence of the selected tokens. The selected list of signatures is then output.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.