Patent · US Active

Document transcription system training

US8335688B2 · kind B2 · utility

10Cited by

40References

38Claims

0Family size

Assignee

Multimodal Technologies, LLC · US

Inventors

Girija Yegnanarayanan · Raleigh, US
Michael Finke · Hövelhof, DE
Juergen Fritsch · Karlsruhe, DE
Detlef Koll · Pittsburgh, US
Monika Woszczyna · Pittsburgh, US

Key dates

Filing date	Aug 20, 2004
Grant date	Dec 18, 2012
Priority date	—
Expiry date	May 14, 2028

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/26
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal transcript of the spoken audio stream. Such a system may identify text in the non-literal transcript which represents concepts having multiple spoken forms. The system may attempt to identify the actual spoken form in the audio stream which produced the corresponding text in the non-literal transcript, and thereby produce a revised transcript which more accurately represents the spoken audio stream. The revised, and more accurate, transcript may be used to train the acoustic model, thereby producing a better acoustic model than that which would be produced using conventional techniques, which perform training based directly on the original non-literal transcript.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.