Systems and methods for information extraction from text documents with spatial context
US11106906B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 9, 2019 |
| Grant date | Aug 31, 2021 |
| Priority date | — |
| Expiry date | Dec 9, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Performing information extraction from an electronic document is disclosed. A method comprises: receiving a semi-structured input document; retrieving an entity model that provides one or more domain variable definitions for one or more domain variables, wherein the entity model and the input document correspond to a common domain; determining that the input document includes an entity that satisfies a first domain variable definition corresponding to a first domain variable; retrieving a relational model that provides, for the first domain variable, one or more relational definitions comprising spatial restrictions for one or more values corresponding to the first domain variable; extracting one or more data elements from the input document that satisfy the one or more relational definitions; and generating an information graph having a structured data format, wherein the one or more data elements extracted from the input document correspond to the first domain variable in the structured data format.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.