Patent · US Active

System and method for automatic detection and verification of optical character recognition data

US11232300B2 · kind B2 · utility

0Cited by
18References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 21, 2019
Grant dateJan 25, 2022
Priority date
Expiry dateMay 16, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V2201/01
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems and methods for automatically verifying optical character recognition (OCR) detected text of a native electronic document having an image layer comprising a matrix of pixels and a text layer comprising a sequence of characters. The method includes determining a location of OCR-detected text in the text layer of the native electronic document based on a pixel-based coordinate location of the OCR-detected text in the image layer of the native electronic document. The method also includes applying the location of the OCR-detected text to the text layer of the native electronic document to detect text in the text layer corresponding to the OCR-detected text. The method also includes rendering only the detected text in the text layer as an output when the OCR-detected text does not match the detected text in the text layer, to improve accuracy of the output text.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.