Patent · US Expired

Automatic indexing and aligning of audio and text using speech recognition

US5649060A · kind A · utility

563Cited by

17References

8Claims

0Family size

Assignee

International Business Machines Corporation · US

Inventors

Hamed A. Ellozy · Mount Kisco, US
Dimitri Kanevsky · Ossining, US
Michelle Kim · Willow View Lane, US
David Nahamoo · Great Neck, US
Michael A. Picheny · White Plains, US
Wlodek W. Zadrozny · Tarrytown, US

Key dates

Filing date	Oct 23, 1995
Grant date	Jul 15, 1997
Priority date	—
Expiry date	Oct 23, 2015

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/226
WIPO fieldAudio-visual technology
WIPO sectorElectrical engineering

Abstract

A method of automatically aligning a written transcript with speech in video and audio clips. The disclosed technique involves as a basic component an automatic speech recognizer. The automatic speech recognizer decodes speech (recorded on a tape) and produces a file with a decoded text. This decoded text is then matched with the original written transcript via identification of similar words or clusters of words. The results of this matching is an alignment of the speech with the original transcript. The method can be used (a) to create indexing of video clips, (b) for "teleprompting" (i.e. showing the next portion of text when someone is reading from a television screen), or (c) to enhance editing of a text that was dictated to a stenographer or recorded on a tape for its subsequent textual reproduction by a typist.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.