Patent · US Active

Grammar-based automated generation of annotated synthetic form training data for machine learning

US10970530B1 · kind B1 · utility

5Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 13, 2018
Grant dateApr 6, 2021
Priority date
Expiry dateJan 4, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/901
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques for grammar-based automated generation of annotated synthetic form training data for machine learning are described. A training data generation engine utilizes a defined grammar to construct a layout for a form, select key-value units to place within the layout, and select attribute variants for the key-value units. The form is rendered and stored at a storage location, where it can be provided along with other similarly-generated forms to be used as training data for a machine learning model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.