Patent · US Active

Pre-training for scene text detection

US12254707B2 · kind B2 · utility

0Cited by
0References
20Claims
0Family size

Assignees

Inventors

Key dates

Filing dateSep 28, 2022
Grant dateMar 18, 2025
Priority date
Expiry dateSep 9, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/19173
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Embodiments of the present disclosure relate to a method, device and computer readable storage medium of scene text detection. In the method, a first visual representation of a first image is generated with an image encoding process. A first textual representation of a first text unit in the first image is generated with a text encoding process based on a first plurality of symbols obtained by masking a first symbol of a plurality of symbols in the first text unit. A first prediction of the masked first symbol is determined with a decoding process based on the first visual and textual representations. At least the image encoding process is updating according to at least a first training objective to increase at least similarity of the first prediction and the masked first symbol.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.