Extraction of lexical kernel units from a domain-specific lexicon
US9588959B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 9, 2015 |
| Grant date | Mar 7, 2017 |
| Priority date | — |
| Expiry date | Feb 13, 2035 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/35
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
According to an aspect, a candidate lexical kernel unit that includes a word token sequence having two or more words is received. Domain terms that contain the two or more words are retrieved from a terminology resource file of domain terms associated with a domain. The candidate lexical kernel unit and the retrieved domain terms are analyzed to determine whether the candidate lexical kernel unit satisfies specified criteria for use as a building block by a natural-language processing (NLP) tool for building larger lexical units in the domain. Each of the larger lexical units includes a greater number of words than the candidate lexical kernel unit. The candidate lexical kernel unit is identified as a lexical kernel unit based on determining that the candidate lexical kernel unit satisfies the specified criteria. The lexical kernel unit is output to a domain-specific lexical kernel unit file for input to the NLP tool.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.