Pre-training for scene text detection
US12254707B2 · kind B2 · utility
Assignees
Inventors
Key dates
| Filing date | Sep 28, 2022 |
| Grant date | Mar 18, 2025 |
| Priority date | — |
| Expiry date | Sep 9, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/19173
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Embodiments of the present disclosure relate to a method, device and computer readable storage medium of scene text detection. In the method, a first visual representation of a first image is generated with an image encoding process. A first textual representation of a first text unit in the first image is generated with a text encoding process based on a first plurality of symbols obtained by masking a first symbol of a plurality of symbols in the first text unit. A first prediction of the masked first symbol is determined with a decoding process based on the first visual and textual representations. At least the image encoding process is updating according to at least a first training objective to increase at least similarity of the first prediction and the masked first symbol.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.