Linked data seeded multi-lingual lexicon extraction
US11163952B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 11, 2018 |
| Grant date | Nov 2, 2021 |
| Priority date | — |
| Expiry date | Mar 29, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/284
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
One embodiment provides a method for relevant language-independent terminology extraction from content, the method including extracting lexicon items from the content based on context extraction patterns using statistical processing. Feedback on the extracted lexicon items is received to automatically tune scores and thresholds for the context extraction patterns. Available Linked Data is leveraged for a bootstrap source. The relevant language-independent terminology extraction is bootstrapped using the bootstrap source.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.