Patent · US Active

Apparatus for automatic theme detection from unstructured data

US10372741B2 · kind B2 · utility

6Cited by
72References
41Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 1, 2013
Grant dateAug 6, 2019
Priority date
Expiry dateAug 19, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/30
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

This apparatus provides a system and method of determining significant repeating themes in a collection of documents. The apparatus operates unsupervised and leverages a natural language processing mechanism supported with lexicon, synonym and taxonomy dictionaries to determine themes and establish their relevance using a two-level hierarchical structure. The apparatus also assigns meaningful names to identified themes and determines a set of rules that describe the theme such that it can be applied to categorize other documents outside of the collection as well.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.