Patent · US Active

OCR error correction

US11663408B1 · kind B1 · utility

0Cited by
1References
20Claims
0Family size

Assignee

Inventor

Key dates

Filing dateDec 17, 2020
Grant dateMay 30, 2023
Priority date
Expiry dateJul 15, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Implementations of the disclosure are directed to OCR error correction systems and methods. In some implementations, a method comprises: obtaining, at a computing device, optical character recognition (OCR) text extracted from a document image, the text comprising a token; searching, at the computing device, based on a token bigram determined from the token and a mapping between words in a corpus and a corpus bigram set comprised of unique bigrams from the beginning or ending of the words in the corpus, the corpus for a best word to replace the token; and replacing, at the computing device, the token with the best word.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.