Patent · US Active

OCR error correction

US10896292B1 · kind B1 · utility

4Cited by
0References
17Claims
0Family size

Assignee

Inventor

Key dates

Filing dateJul 17, 2020
Grant dateJan 19, 2021
Priority date
Expiry dateJul 17, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Implementations of the disclosure are directed to OCR error correction systems and methods. In some implementations, a method comprises: obtaining, at a computing device, optical character recognition (OCR) text extracted from a document image, the text comprising a token; searching, at the computing device, based on a token bigram determined from the token and a mapping between words in a corpus and a corpus bigram set comprised of unique bigrams from the beginning or ending of the words in the corpus, the corpus for a best word to replace the token; and replacing, at the computing device, the token with the best word.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.