Patent · US Expired

Methods and apparatus for automatically synchronizing electronic audio files with electronic text files

US6260011A · kind A · utility

295Cited by
10References
36Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 20, 2000
Grant dateJul 10, 2001
Priority date
Expiry dateMar 20, 2020

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/08
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

Automated methods and apparatus for synchronizing audio and text data, e.g., in the form of electronic files, representing audio and text expressions of the same work or information are described. A statistical language model is generated from the text data. A speech recognition operation is then performed on the audio data using the generated language model and a speaker independent acoustic model. Silence is modeled as a word which can be recognized. The speech recognition operation produces a time indexed set of recognized words some of which may be silence. The recognized words are globally aligned with the words in the text data. Recognized periods of silence, which correspond to expected periods of silence, and are adjoined by one or more correctly recognized words are identified as points where the text and audio files should be synchronized, e.g., by the insertion of bi-directional pointers. In one embodiment, for a text location to be identified for synchronization purposes, both words which bracket, e.g., precede and follow, the recognized silence must be correctly identified. Pointers, corresponding to identified locations of silence to be used for synchronization purpos…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.