Patent · US Active

Systems and methods for information extraction from text documents with spatial context

US11106906B2 · kind B2 · utility

4Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 9, 2019
Grant dateAug 31, 2021
Priority date
Expiry dateDec 9, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Performing information extraction from an electronic document is disclosed. A method comprises: receiving a semi-structured input document; retrieving an entity model that provides one or more domain variable definitions for one or more domain variables, wherein the entity model and the input document correspond to a common domain; determining that the input document includes an entity that satisfies a first domain variable definition corresponding to a first domain variable; retrieving a relational model that provides, for the first domain variable, one or more relational definitions comprising spatial restrictions for one or more values corresponding to the first domain variable; extracting one or more data elements from the input document that satisfy the one or more relational definitions; and generating an information graph having a structured data format, wherein the one or more data elements extracted from the input document correspond to the first domain variable in the structured data format.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.