Generating searchable text for documents portrayed in a repository of digital images utilizing orientation and text prediction neural networks
US11645826B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 14, 2020 |
| Grant date | May 9, 2023 |
| Priority date | — |
| Expiry date | Mar 11, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/40
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The present disclosure relates to generating computer searchable text from digital images that depict documents utilizing an orientation neural network and/or text prediction neural network. For example, one or more embodiments detect digital images that depict documents, identify the orientation of the depicted documents, and generate computer searchable text from the depicted documents in the detected digital images. In particular, one or more embodiments train an orientation neural network to identify the orientation of a depicted document in a digital image. Additionally, one or more embodiments train a text prediction neural network to analyze a depicted document in a digital image to generate computer searchable text from the depicted document. By utilizing the identified orientation of the depicted document before analyzing the depicted document with a text prediction neural network, the disclosed systems can efficiently and accurately generate computer searchable text for a digital image that depicts a document.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.