Patent · US Expired

Methods and apparatus for performing speech recognition using acoustic models which are improved through an interactive process

US6263308A · kind A · utility

121Cited by

9References

28Claims

0Family size

Assignee

Microsoft Corporation · US

Inventors

David E. Heckerman · Bellevue, US
Fileno A. Alleva · Redmond, US
Robert L. Rounthwaite · Fall City, US
Daniel Rosen · Bellevue, US
Mei-Yuh Hwang · Sammamish, US
Yoram Yaacovi · Redmond, US
John L. Manferdelli · San Francisco, US

Key dates

Filing date	Mar 20, 2000
Grant date	Jul 17, 2001
Priority date	—
Expiry date	Mar 20, 2020

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/063
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Automated methods and apparatus for synchronizing audio and text data, e.g., in the form of electronic files, representing audio and text expressions of the same work or information are described. Also described are automated methods of detecting errors and other discrepancies between the audio and text versions of the same work. A speech recognition operation is performed on the audio data initially using a speaker independent acoustic model. The recognized text in addition to audio time stamps are produced by the speech recognition operation. The recognized text is compared to the text in text data to identify correctly recognized words. The acoustic model is then retrained using the correctly recognized text and corresponding audio segments from the audio data transforming the initial acoustic model into a speaker trained acoustic model. The retrained acoustic model is then used to perform an additional speech recognition operation on the audio data. The audio and text data are synchronized using the results of the updated acoustic model. In addition, one or more error reports based on the final recognition results are generated showing discrepancies between the recognized words…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.