Apparatus for automatic theme detection from unstructured data
US10372741B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 1, 2013 |
| Grant date | Aug 6, 2019 |
| Priority date | — |
| Expiry date | Aug 19, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/30
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
This apparatus provides a system and method of determining significant repeating themes in a collection of documents. The apparatus operates unsupervised and leverages a natural language processing mechanism supported with lexicon, synonym and taxonomy dictionaries to determine themes and establish their relevance using a two-level hierarchical structure. The apparatus also assigns meaningful names to identified themes and determines a set of rules that describe the theme such that it can be applied to categorize other documents outside of the collection as well.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.