Generating structured representations of forms using machine learning
US11521405B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Apr 29, 2021 |
| Grant date | Dec 6, 2022 |
| Priority date | — |
| Expiry date | Jul 24, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N5/022
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method may include acquiring, from an initial document having a document type, initial document elements and initial attributes, deriving initial features for the initial document elements using the initial attributes, detecting initial form components using the initial features, clustering the initial form components into initial line objects of an initial structured representation by applying an unsupervised machine learning model to the geometric attributes of the initial document elements, acquiring, from a next document having the document type, next document elements and next attributes describing the next document elements, deriving next features for the next document elements using the next attributes, detecting next form components using the next features, determining that the initial form components and the next form components are different, clustering the next form components into next line objects of a next structured representation, and replacing the initial structured representation with the next structured representation.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.