Patent · US Active

Multimodal speech recognition system

US8355915B2 · kind B2 · utility

15Cited by
22References
19Claims
0Family size

Inventor

Key dates

Filing dateNov 30, 2007
Grant dateJan 15, 2013
Priority date
Expiry dateSep 9, 2031

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/32
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The disclosure describes an overall system/method for text-input using a multimodal interface with speech recognition. Specifically, pluralities of modes interact with the main speech mode to provide the speech-recognition system with partial knowledge of the text corresponding to the spoken utterance forming the input to the speech recognition system. The knowledge from other modes is used to dynamically change the ASR system's active vocabulary thereby significantly increasing recognition accuracy and significantly reducing processing requirements. Additionally, the speech recognition system is configured using three different system configurations (always listening, partially listening, and push-to-speak) and for each one of those three different user-interfaces are proposed (speak-and-type, type-and-speak, and speak-while-typing). Finally, the overall user-interface of the proposed system is designed such that it enhances existing standard text-input methods; thereby minimizing the behavior change for mobile users.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.