Patent · US Active

Multimodal speech recognition system

US8355915B2 · kind B2 · utility

15Cited by

22References

19Claims

0Family size

Inventor

Ashwin Rao · Kirkland, US

Key dates

Filing date	Nov 30, 2007
Grant date	Jan 15, 2013
Priority date	—
Expiry date	Sep 9, 2031

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/32
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

The disclosure describes an overall system/method for text-input using a multimodal interface with speech recognition. Specifically, pluralities of modes interact with the main speech mode to provide the speech-recognition system with partial knowledge of the text corresponding to the spoken utterance forming the input to the speech recognition system. The knowledge from other modes is used to dynamically change the ASR system's active vocabulary thereby significantly increasing recognition accuracy and significantly reducing processing requirements. Additionally, the speech recognition system is configured using three different system configurations (always listening, partially listening, and push-to-speak) and for each one of those three different user-interfaces are proposed (speak-and-type, type-and-speak, and speak-while-typing). Finally, the overall user-interface of the proposed system is designed such that it enhances existing standard text-input methods; thereby minimizing the behavior change for mobile users.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.