Method and system for processing parallel context dependent speech recognition results from a single utterance utilizing a context database
US9117453B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Dec 30, 2010 |
| Grant date | Aug 25, 2015 |
| Priority date | — |
| Expiry date | Aug 22, 2032 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/228
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method of and system for accurately determining a caller response by processing speech-recognition results and returning that result to a directed-dialog application for further interaction with the caller. Multiple speech-recognition engines are provided that process the caller response in parallel. Returned speech-recognition results comprising confidence-score values and word-score values from each of the speech-recognition engines may be modified based on context information provided by the directed-dialog application and grammars associated with each speech-recognition engine. A context database is used to further reduce or add weight to confidence-score values and word-score values, remove phrases and/or words, and add phrases and/or words to the speech-recognition engine results. In situations where a predefined threshold-confidence-score value is not exceeded, a new dynamic grammar may be created. A set of n-best hypotheses of what the caller uttered is returned to the directed-dialog application.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.