Patent · US Active

System and method for automatic detection and verification of optical character recognition data

US10489645B2 · kind B2 · utility

10Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 15, 2018
Grant dateNov 26, 2019
Priority date
Expiry dateJun 19, 2038

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods for automatically verifying text detected by optical character recognition (OCR). The method includes obtaining a native digital document having an image layer comprising a matrix of computer-renderable pixels and a text layer comprising computer-readable encodings of a sequence of characters. The method includes obtaining OCR-detected text from the image layer of the native digital document and a pixel-based coordinate location of the OCR-detected text in the image layer of the native digital document. The method includes determining, using a pixel transformation, a computer-interpretable location of the OCR-detected text in the text layer of the native digital document. The method includes detecting text in the text layer based on the computer-interpretable location of the OCR-detected text in the text layer. The method includes rendering only the detected text in the text layer when the OCR-detected text does not match the detected text in the text layer.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.