Patent · US Active

Generating structured representations of forms using machine learning

US11521405B2 · kind B2 · utility

1Cited by
9References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 29, 2021
Grant dateDec 6, 2022
Priority date
Expiry dateJul 24, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N5/022
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method may include acquiring, from an initial document having a document type, initial document elements and initial attributes, deriving initial features for the initial document elements using the initial attributes, detecting initial form components using the initial features, clustering the initial form components into initial line objects of an initial structured representation by applying an unsupervised machine learning model to the geometric attributes of the initial document elements, acquiring, from a next document having the document type, next document elements and next attributes describing the next document elements, deriving next features for the next document elements using the next attributes, detecting next form components using the next features, determining that the initial form components and the next form components are different, clustering the next form components into next line objects of a next structured representation, and replacing the initial structured representation with the next structured representation.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.