Method and system of extracting label:value data from a document
US9613267B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 3, 2014 |
| Grant date | Apr 4, 2017 |
| Priority date | — |
| Expiry date | Dec 21, 2034 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/416
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
This disclosure provides an exemplary method and system for extracting structured label and value pairwise textual data from a textual document. According to an exemplary method, initially a layout analysis is performed resulting in one or more alternatives for grouping and ordering the textual elements of interest. Next, textual elements are tagged as including a label term, a value term or a label and value term. Finally, a sequence-based method is applied to the tagged elements to generate one or more sequence listings representative of the label and value pairwise data structure(s) and label:value pairwise data is extracted.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.