System and method for extracting information from unstructured text
US10002129B1 · kind B1 · utility
Assignee
Inventor
Key dates
| Filing date | Mar 30, 2017 |
| Grant date | Jun 19, 2018 |
| Priority date | — |
| Expiry date | Mar 30, 2037 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/211
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
This disclosure relates generally to natural language processing, and more particularly to a system and method for extracting subject-verb-object (SVO) chunked text from an unstructured text. In one embodiment, a method is provided for extracting SVO chunked text from an unstructured text. The method comprises identifying a plurality of part of speech (PoS) tokens in the unstructured text, and determining a plurality of SVO chunked text directly from the plurality of PoS tokens using a machine learning chunker model. The machine learning chunker model is trained on a subject-verb-object (SVO) annotated training data.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.