Patent · US Expired

Neural network acoustic and visual speech recognition system

US5586215A · kind A · utility

39Cited by
5References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 26, 1992
Grant dateDec 17, 1996
Priority date
Expiry dateMay 26, 2012

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L25/18
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The apparatus for the recognition of speech comprises an acoustic preprocessor, a visual preprocessor, and a speech classifier that operates on the acoustic and visual preprocessed data. The acoustic preprocessor comprises a log mel spectrum analyzer that produces an equal mel bandwidth log power spectrum. The visual processor detects the motion of a set of fiducial markers on the speaker's face and extracts a set of normalized distance vectors describing lip and mouth movement. The speech classifier uses a multilevel time-delay neural network operating on the preprocessed acoustic and visual data to form an output probability distribution that indicates the probability of each candidate utterance having been spoken, based on the acoustic and visual data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.