Multimodal speech recognition system
US8355915B2 · kind B2 · utility
Inventor
Key dates
| Filing date | Nov 30, 2007 |
| Grant date | Jan 15, 2013 |
| Priority date | — |
| Expiry date | Sep 9, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/32
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The disclosure describes an overall system/method for text-input using a multimodal interface with speech recognition. Specifically, pluralities of modes interact with the main speech mode to provide the speech-recognition system with partial knowledge of the text corresponding to the spoken utterance forming the input to the speech recognition system. The knowledge from other modes is used to dynamically change the ASR system's active vocabulary thereby significantly increasing recognition accuracy and significantly reducing processing requirements. Additionally, the speech recognition system is configured using three different system configurations (always listening, partially listening, and push-to-speak) and for each one of those three different user-interfaces are proposed (speak-and-type, type-and-speak, and speak-while-typing). Finally, the overall user-interface of the proposed system is designed such that it enhances existing standard text-input methods; thereby minimizing the behavior change for mobile users.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.