Method for domain identification of documents in a document database
US7814105B2 · kind B2 · utility
9Cited by
26References
37Claims
0Family size
Assignee
Inventors
Key dates
| Filing date | May 5, 2006 |
| Grant date | Oct 12, 2010 |
| Priority date | — |
| Expiry date | Mar 23, 2027 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/313
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for processing documents in a document database includes determining vocabulary words for each document, and determining a respective relevancy for each vocabulary word based upon occurrences thereof in all of the documents. Similarities are determined between the documents based upon the vocabulary words and their respective relevancies. At least one domain identification is determined for the documents based upon the determined similarities.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.