Patent · US Expired

Method and apparatus for calculating similarity among documents

US7440938B2 · kind B2 · utility

9Cited by
5References
6Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 5, 2004
Grant dateOct 21, 2008
Priority date
Expiry dateMar 21, 2025

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99935
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Information that individual elements (characteristic character strings) indicative of characteristics of a registered document appear in the registered document is stored in advance. When calculating similarity of the registered document, a query designated by a searcher is analyzed. The query is represented by a characteristic vector having the individual elements which take the relation between a plurality of words into consideration. Pieces of appearance information of the individual words contained in the query are counted. The counted appearance information is compared with a searching index to calculate similarity between documents.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.