Patent · US Active

Model training for automatic speech recognition from imperfect transcription data

US9280969B2 · kind B2 · utility

8Cited by

19References

15Claims

0Family size

Assignee

MICROSOFT TECHNOLOGY LICENSING, LLC · US

Inventors

Jinyu Li · Beijing, CN
Yifan Gong · Sammamish, US
Chaojun Liu · Redmond, US
Kaisheng Yao · Newcastle, US

Key dates

Filing date	Jun 10, 2009
Grant date	Mar 8, 2016
Priority date	—
Expiry date	Aug 16, 2033

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/065
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Techniques and systems for training an acoustic model are described. In an embodiment, a technique for training an acoustic model includes dividing a corpus of training data that includes transcription errors into N parts, and on each part, decoding an utterance with an incremental acoustic model and an incremental language model to produce a decoded transcription. The technique may further include inserting silence between a pair of words into the decoded transcription and aligning an original transcription corresponding to the utterance with the decoded transcription according to time for each part. The technique may further include selecting a segment from the utterance having at least Q contiguous matching aligned words, and training the incremental acoustic model with the selected segment. The trained incremental acoustic model may then be used on a subsequent part of the training data. Other embodiments are described and claimed.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.