Patent · US Active

Automated extraction of unstructured tables and semantic information from arbitrary documents

US10878195B2 · kind B2 · utility

13Cited by
12References
20Claims
0Family size

Assignee

Inventor

Key dates

Filing dateMay 3, 2018
Grant dateDec 29, 2020
Priority date
Expiry dateJan 9, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A “Table Extractor” provides various techniques for automatically delimiting and extracting tables from arbitrary documents. In various implementations, the Table extractor also generates functional relationships on those tables that are suitable for generating query responses via any of a variety of natural language processing techniques. In other words, the Table Extractor provides techniques for detecting and representing table information in a way suitable for information extraction. These techniques output relational functions on the table in the form of tuples constructed from automatically identified headers and labels and the relationships between those headers and labels and the contents of one or more cells of the table. These tuples are suitable for correlating natural language questions about a specific piece of information in the table with the rows, columns, and/or cells that contain that information.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.