Patent · US Active

Simultaneous tracking and text recognition in video frames

US9064174B2 · kind B2 · utility

12Cited by

3References

20Claims

0Family size

Assignee

MICROSOFT TECHNOLOGY LICENSING, LLC · US

Inventors

David Nister · Bellevue, US
Frederik Schaffalitzky · Kirkland, US
Michael Grabner · Redmond, US
Matthew S. Ashman · Seattle, US
Milan Vugdelija · Umka, RS
Ivan Stojiljkovic · Umka, RS

Key dates

Filing date	Oct 18, 2012
Grant date	Jun 23, 2015
Priority date	—
Expiry date	Apr 30, 2033

Classification

Technology area (CPC G)Physics
CPC primaryG06V30/10
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Architecture that enables optical character recognition (OCR) of text in video frames at the rate at which the frames are received. Additionally, conflation is performed on multiple text recognition results in the frame sequence. The architecture comprises an OCR text recognition engine and a tracker system; the tracker system establishes a common coordinate system in which OCR results from different frames may be compared and/or combined. From a set of sequential video frames, a keyframe is chosen from which the reference coordinate system is established. An estimated transformation from keyframe coordinates to subsequent video frames is computed using the tracker system. When text recognition is completed for any subsequent frame, the result coordinates can be related to the keyframe using the inverse transformation from the processed frame to the reference keyframe. The results can be rendered for viewing as the results are obtained.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.