Identifying ambiguity in semantic resources
US11379669B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 29, 2019 |
| Grant date | Jul 5, 2022 |
| Priority date | — |
| Expiry date | Apr 9, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/242
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Embodiments relate to a system, program product, and method for dictionary membership management directed at identifying ambiguity in semantic resources. A dictionary of seed terms is applied to a text corpus and matching items in the corpus are identified. The linguistic properties for each matching item are characterized and a context pattern of each matching item is constructed. Each context pattern is applied to the dictionary and matching content between the seed terms and the context pattern is identified and quantified. Lexicon items from the dictionary that have anomalous behavior reflected in the quantification are identified. One or more seed words identified as having anomalous behavior are selectively removed from the dictionary.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.