Patent · US Active

Matching engine with signature generation

US7516130B2 · kind B2 · utility

16Cited by
13References
10Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 24, 2006
Grant dateApr 7, 2009
Priority date
Expiry dateNov 20, 2026

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99935
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and a method generates at least one signature associated with document. In one embodiment, a document comprised of text is received and parsed to generate a token set. The token set includes a plurality of tokens. Each token corresponds to the text in the document that is separated by a predefined character characteristic. A score is calculated for each token in the token set based on a frequency and distribution of the text in the document. Each token is then ranked based on the calculated score. A subset of the ranked tokes is selected and a signature is generated for each occurrence of the selected tokens. The selected list of signatures is then output.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.