Text extraction module for contextual analysis engine
US10235681B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 15, 2013 |
| Grant date | Mar 19, 2019 |
| Priority date | — |
| Expiry date | Oct 15, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/237
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A contextual analysis engine systematically extracts, analyzes and organizes digital content stored in an electronic file such as a webpage. Content can be extracted using a text extraction module which is capable of separating the content which is to be analyzed from less meaningful content such as format specifications and programming scripts. The resulting unstructured corpus of plain text can then be passed to a text analytics module capable of generating a structured categorization of topics included within the content. This structured categorization can be organized based on a content topic ontology which may have been previously defined or which may be developed in real-time. The systems disclosed herein optionally include an input/output interface capable of managing workflows of the text extraction module and the text analytics module, administering a cache of previously generated results, and interfacing with other applications that leverage the disclosed contextual analysis services.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.