Patent · US Active

Text extraction module for contextual analysis engine

US10235681B2 · kind B2 · utility

18Cited by
9References
21Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 15, 2013
Grant dateMar 19, 2019
Priority date
Expiry dateOct 15, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/237
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A contextual analysis engine systematically extracts, analyzes and organizes digital content stored in an electronic file such as a webpage. Content can be extracted using a text extraction module which is capable of separating the content which is to be analyzed from less meaningful content such as format specifications and programming scripts. The resulting unstructured corpus of plain text can then be passed to a text analytics module capable of generating a structured categorization of topics included within the content. This structured categorization can be organized based on a content topic ontology which may have been previously defined or which may be developed in real-time. The systems disclosed herein optionally include an input/output interface capable of managing workflows of the text extraction module and the text analytics module, administering a cache of previously generated results, and interfacing with other applications that leverage the disclosed contextual analysis services.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.