Representative document selection for a set of duplicate documents
US8868559B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Aug 30, 2012 |
| Grant date | Oct 21, 2014 |
| Priority date | — |
| Expiry date | Aug 30, 2032 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99954
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Systems and methods for indexing a representative document from a set of duplicate documents are disclosed. Disclosed systems and methods comprise selecting a first document in a plurality of documents on the basis that the first document is associated with a query independent score. Each respective document in the plurality of documents has a fingerprint that indicates that the respective document has substantially identical content to every other document in the plurality of documents. Disclosed systems and methods further comprise indexing, in accordance with the query independent score, the first document thereby producing an indexed first document. With respect to the plurality of documents, only the indexed first document is included in a document index.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.