Translation of entity names based on source document publication date, and frequency and co-occurrence of the entity names
US8706474B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Apr 25, 2008 |
| Grant date | Apr 22, 2014 |
| Priority date | — |
| Expiry date | Apr 19, 2032 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/55
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods and apparatus, including computer program products, related to entity name translation. In general, a set of data records including one or more names of one or more entities in a source language is received and candidate translations are generated for the names of the data records. The candidate translations are strings of characters in a target language that has a different orthography than an orthography of the source language. A data repository is queried for records matching the candidate translations. A translation of at least one of the candidate translations is selected based on a comparison of a combination of candidate translations and properties of the source record comprising of the publication date, entity name co-occurrence and their frequency with the result of the querying the data repository.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.