Method and apparatus for calculating similarity among documents
US7440938B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 5, 2004 |
| Grant date | Oct 21, 2008 |
| Priority date | — |
| Expiry date | Mar 21, 2025 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99935
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Information that individual elements (characteristic character strings) indicative of characteristics of a registered document appear in the registered document is stored in advance. When calculating similarity of the registered document, a query designated by a searcher is analyzed. The query is represented by a characteristic vector having the individual elements which take the relation between a plurality of words into consideration. Pieces of appearance information of the individual words contained in the query are counted. The counted appearance information is compared with a searching index to calculate similarity between documents.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.