Patent · US Active

Text recognition and localization with deep learning

US10032072B1 · kind B1 · utility

46Cited by

2References

19Claims

0Family size

Assignee

A9.com, Inc. · US

Inventors

Son Dinh Tran · Mountain View, US
R. Manmatha · San Francisco, US

Key dates

Filing date	Jun 21, 2016
Grant date	Jul 24, 2018
Priority date	—
Expiry date	Jul 22, 2036

Classification

Technology area (CPC G)Physics
CPC primaryG06V30/414
WIPO fieldAudio-visual technology
WIPO sectorElectrical engineering

Abstract

Approaches provide for identifying text represented in image data as well as determining a location or region of the image data that includes the text represented in the image data. For example, a camera of a computing device can be used to capture a live camera view of one or more items. The live camera view can be presented to the user on a display screen of the computing device. An application executing on the computing device or at least in communication with the computing device can analyze the image data of the live camera view to identify text represented in the image data as well as determine locations or regions of the image that include the representations. For example, one such recognition approach includes a region proposal process to generate a plurality of candidate bounding boxes, a region filtering process to determine a subset of the plurality of candidate bounding boxes, a region refining process to refine the bounding box coordinates to more accurately fit the identified text, a text recognizer process to recognize words in the refined bounding boxes, and a post-processing process to suppress overlapping bounding boxes to generate a final set of bounding boxes.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.