Document digitization, transformation and validation
US11899727B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Aug 30, 2021 |
| Grant date | Feb 13, 2024 |
| Priority date | — |
| Expiry date | Feb 9, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/58
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An Artificial Intelligence (AI) based document digitization, transformation and validation system extracts fields from digital documents via different document digitization processes. A document packet with a plurality of documents is initially accessed and any non-digital documents in the document packet are digitized. The errors in the digitized documents are corrected and non-English documents are translated into English. Each of the documents is provided to a plurality of digitization services for the extraction of fields by a plurality of field extraction models. If a field has multiple field instances extracted by more than one digitization service, then a field instance with the highest confidence score is selected for inclusion into the consolidated results. The consolidated results produced in different JavaScript Object Notation (JSON) formats are converted into a common JSON format which may be further validated and provided to downstream processes.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.