OCR error correction
US11663408B1 · kind B1 · utility
Assignee
Inventor
Key dates
| Filing date | Dec 17, 2020 |
| Grant date | May 30, 2023 |
| Priority date | — |
| Expiry date | Jul 15, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Implementations of the disclosure are directed to OCR error correction systems and methods. In some implementations, a method comprises: obtaining, at a computing device, optical character recognition (OCR) text extracted from a document image, the text comprising a token; searching, at the computing device, based on a token bigram determined from the token and a mapping between words in a corpus and a corpus bigram set comprised of unique bigrams from the beginning or ending of the words in the corpus, the corpus for a best word to replace the token; and replacing, at the computing device, the token with the best word.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.