Domain similarity scores for information retrieval
US10380163B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 14, 2016 |
| Grant date | Aug 13, 2019 |
| Priority date | — |
| Expiry date | Jul 17, 2037 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/35
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Various embodiments of systems, computer program products, and methods to provide domain similarity scores for information retrieval are described herein. In an aspect, a plurality of files associated with a plurality of domains are retrieved. A corpus corresponding to the plurality of domains is generated based on the plurality of files by integrating the plurality of files corresponding to the plurality of domains. Further, similarity scores between the plurality of domains are determined based on the generated corpus. The similarity scores between the plurality of domains are persisted in a similarity scores repository to enable information retrieval during translating data between different languages.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.