System and method for automatic detection and verification of optical character recognition data
US10489645B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 15, 2018 |
| Grant date | Nov 26, 2019 |
| Priority date | — |
| Expiry date | Jun 19, 2038 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods for automatically verifying text detected by optical character recognition (OCR). The method includes obtaining a native digital document having an image layer comprising a matrix of computer-renderable pixels and a text layer comprising computer-readable encodings of a sequence of characters. The method includes obtaining OCR-detected text from the image layer of the native digital document and a pixel-based coordinate location of the OCR-detected text in the image layer of the native digital document. The method includes determining, using a pixel transformation, a computer-interpretable location of the OCR-detected text in the text layer of the native digital document. The method includes detecting text in the text layer based on the computer-interpretable location of the OCR-detected text in the text layer. The method includes rendering only the detected text in the text layer when the OCR-detected text does not match the detected text in the text layer.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.