Simultaneous tracking and text recognition in video frames
US9064174B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 18, 2012 |
| Grant date | Jun 23, 2015 |
| Priority date | — |
| Expiry date | Apr 30, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Architecture that enables optical character recognition (OCR) of text in video frames at the rate at which the frames are received. Additionally, conflation is performed on multiple text recognition results in the frame sequence. The architecture comprises an OCR text recognition engine and a tracker system; the tracker system establishes a common coordinate system in which OCR results from different frames may be compared and/or combined. From a set of sequential video frames, a keyframe is chosen from which the reference coordinate system is established. An estimated transformation from keyframe coordinates to subsequent video frames is computed using the tracker system. When text recognition is completed for any subsequent frame, the result coordinates can be related to the keyframe using the inverse transformation from the processed frame to the reference keyframe. The results can be rendered for viewing as the results are obtained.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.