Automatic indexing and aligning of audio and text using speech recognition
US5649060A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Oct 23, 1995 |
| Grant date | Jul 15, 1997 |
| Priority date | — |
| Expiry date | Oct 23, 2015 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/226
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
A method of automatically aligning a written transcript with speech in video and audio clips. The disclosed technique involves as a basic component an automatic speech recognizer. The automatic speech recognizer decodes speech (recorded on a tape) and produces a file with a decoded text. This decoded text is then matched with the original written transcript via identification of similar words or clusters of words. The results of this matching is an alignment of the speech with the original transcript. The method can be used (a) to create indexing of video clips, (b) for "teleprompting" (i.e. showing the next portion of text when someone is reading from a television screen), or (c) to enhance editing of a text that was dictated to a stenographer or recorded on a tape for its subsequent textual reproduction by a typist.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.