Corpus search systems and methods
US10102274B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Mar 17, 2014 |
| Grant date | Oct 16, 2018 |
| Priority date | — |
| Expiry date | Nov 5, 2035 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/24578
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A corpus of texts relating to a domain of knowledge may be searched by determining noun-pair proximity scores measuring associations between pairs of nouns that appear in the corpus and that are semantically related to the domain of knowledge. When a search term is received, the noun-pair proximity scores may be used (at least in part) to identify one or more related nouns that are strongly associated with the search term within the corpus. One or more texts may be selected from the corpus, texts in which the search term and the related nouns appear near each other in one or more places. The selected texts may be categorized and/or clustered based on the related nouns before being returned for presentation as SearchResults.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.