System and method for automatic detection and verification of optical character recognition data
US11232300B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 21, 2019 |
| Grant date | Jan 25, 2022 |
| Priority date | — |
| Expiry date | May 16, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V2201/01
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Systems and methods for automatically verifying optical character recognition (OCR) detected text of a native electronic document having an image layer comprising a matrix of pixels and a text layer comprising a sequence of characters. The method includes determining a location of OCR-detected text in the text layer of the native electronic document based on a pixel-based coordinate location of the OCR-detected text in the image layer of the native electronic document. The method also includes applying the location of the OCR-detected text to the text layer of the native electronic document to detect text in the text layer corresponding to the OCR-detected text. The method also includes rendering only the detected text in the text layer as an output when the OCR-detected text does not match the detected text in the text layer, to improve accuracy of the output text.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.