Patent · US Active

System and method for speech understanding via integrated audio and visual based speech recognition

US11017779B2 · kind B2 · utility

0Cited by

12References

15Claims

0Family size

Assignee

DMAI, INC. · US

Inventors

Nishant Shukla · Hermosa Beach, US
Ashwin Dharne · Irvine, US

Key dates

Filing date	Feb 15, 2019
Grant date	May 25, 2021
Priority date	—
Expiry date	Feb 15, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/223
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

The present teaching relates to method, system, medium, and implementations for speech recognition. An audio signal is received that represents a speech of a user engaged in a dialogue. A visual signal is received that captures the user uttering the speech. A first speech recognition result is obtained by performing audio based speech recognition based on the audio signal. Based on the visual signal, lip movement of the user is detected and a second speech recognition result is obtained by performing lip reading based speech recognition. The first and the second speech recognition results are then integrated to generate an integrated speech recognition result.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.