Identifying documents which form translated pairs, within a document collection
US7813918B2 · kind B2 · utility
84Cited by
55References
19Claims
0Family size
Assignee
Inventors
Key dates
| Filing date | Aug 3, 2005 |
| Grant date | Oct 12, 2010 |
| Priority date | — |
| Expiry date | Apr 2, 2028 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/45
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A training system for text to text application. The training system finds groups of documents, and identifies automatically similar documents in the groups which are similar. The automatically identified documents can then be used for training of the text to text application. The comparison uses reduced size versions of the documents in order to minimize the amount of processing.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.