Patent · US Expired

Methods and apparatus for performing speech recognition using acoustic models which are improved through an interactive process

US6263308A · kind A · utility

121Cited by
9References
28Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 20, 2000
Grant dateJul 17, 2001
Priority date
Expiry dateMar 20, 2020

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/063
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Automated methods and apparatus for synchronizing audio and text data, e.g., in the form of electronic files, representing audio and text expressions of the same work or information are described. Also described are automated methods of detecting errors and other discrepancies between the audio and text versions of the same work. A speech recognition operation is performed on the audio data initially using a speaker independent acoustic model. The recognized text in addition to audio time stamps are produced by the speech recognition operation. The recognized text is compared to the text in text data to identify correctly recognized words. The acoustic model is then retrained using the correctly recognized text and corresponding audio segments from the audio data transforming the initial acoustic model into a speaker trained acoustic model. The retrained acoustic model is then used to perform an additional speech recognition operation on the audio data. The audio and text data are synchronized using the results of the updated acoustic model. In addition, one or more error reports based on the final recognition results are generated showing discrepancies between the recognized words…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.