Automated extraction of unstructured tables and semantic information from arbitrary documents
US10878195B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | May 3, 2018 |
| Grant date | Dec 29, 2020 |
| Priority date | — |
| Expiry date | Jan 9, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A “Table Extractor” provides various techniques for automatically delimiting and extracting tables from arbitrary documents. In various implementations, the Table extractor also generates functional relationships on those tables that are suitable for generating query responses via any of a variety of natural language processing techniques. In other words, the Table Extractor provides techniques for detecting and representing table information in a way suitable for information extraction. These techniques output relational functions on the table in the form of tuples constructed from automatically identified headers and labels and the relationships between those headers and labels and the contents of one or more cells of the table. These tuples are suitable for correlating natural language questions about a specific piece of information in the table with the rows, columns, and/or cells that contain that information.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.