Patent · US Active

Building classification and extraction models based on electronic forms

US10140511B2 · kind B2 · utility

11Cited by
5References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 30, 2016
Grant dateNov 27, 2018
Priority date
Expiry dateApr 30, 2037

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V2201/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

According to one embodiment, a computer-implemented method is configured for building a classification and/or data extraction knowledge base using an electronic form. The method includes: receiving an electronic form having associated therewith a plurality of metadata labels, each metadata label corresponding to at least one element of interest represented within the electronic form; parsing the plurality of metadata labels to determine characteristic features of the element(s) of interest; building a representation of the electronic form based on the plurality of metadata labels; generating a plurality of permutations of the representation of the electronic form by applying a predetermined set of variations to the representation; and training either a classification model, an extraction model, or both using: the representation of the electronic form, and the plurality of permutations of the representation of the electronic form. Corresponding systems and computer program products are also disclosed.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.