Model training for automatic speech recognition from imperfect transcription data
US9280969B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 10, 2009 |
| Grant date | Mar 8, 2016 |
| Priority date | — |
| Expiry date | Aug 16, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/065
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques and systems for training an acoustic model are described. In an embodiment, a technique for training an acoustic model includes dividing a corpus of training data that includes transcription errors into N parts, and on each part, decoding an utterance with an incremental acoustic model and an incremental language model to produce a decoded transcription. The technique may further include inserting silence between a pair of words into the decoded transcription and aligning an original transcription corresponding to the utterance with the decoded transcription according to time for each part. The technique may further include selecting a segment from the utterance having at least Q contiguous matching aligned words, and training the incremental acoustic model with the selected segment. The trained incremental acoustic model may then be used on a subsequent part of the training data. Other embodiments are described and claimed.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.