Patent · US Active

Method and system of extracting label:value data from a document

US9613267B2 · kind B2 · utility

9Cited by
9References
25Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 3, 2014
Grant dateApr 4, 2017
Priority date
Expiry dateDec 21, 2034

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/416
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

This disclosure provides an exemplary method and system for extracting structured label and value pairwise textual data from a textual document. According to an exemplary method, initially a layout analysis is performed resulting in one or more alternatives for grouping and ordering the textual elements of interest. Next, textual elements are tagged as including a label term, a value term or a label and value term. Finally, a sequence-based method is applied to the tagged elements to generate one or more sequence listings representative of the label and value pairwise data structure(s) and label:value pairwise data is extracted.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.