Methods and apparatus for automatically synchronizing electronic audio files with electronic text files
US6260011A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Mar 20, 2000 |
| Grant date | Jul 10, 2001 |
| Priority date | — |
| Expiry date | Mar 20, 2020 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/08
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
Automated methods and apparatus for synchronizing audio and text data, e.g., in the form of electronic files, representing audio and text expressions of the same work or information are described. A statistical language model is generated from the text data. A speech recognition operation is then performed on the audio data using the generated language model and a speaker independent acoustic model. Silence is modeled as a word which can be recognized. The speech recognition operation produces a time indexed set of recognized words some of which may be silence. The recognized words are globally aligned with the words in the text data. Recognized periods of silence, which correspond to expected periods of silence, and are adjoined by one or more correctly recognized words are identified as points where the text and audio files should be synchronized, e.g., by the insertion of bi-directional pointers. In one embodiment, for a text location to be identified for synchronization purposes, both words which bracket, e.g., precede and follow, the recognized silence must be correctly identified. Pointers, corresponding to identified locations of silence to be used for synchronization purpos…
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.