Patent · US Expired

Method and apparatus for synthesizing realistic animations of a human speaking using a computer

US6097381A · kind A · utility

26Cited by
5References
12Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 8, 1998
Grant dateAug 1, 2000
Priority date
Expiry dateMay 8, 2018

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2021/105
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method and apparatus for synthesizing speech or facial movements to match selected speech sequences. A videotape of an arbitrary text sequence is obtained including a plurality of images of a user speaking various sequences. Video images corresponding to specific spoken phonemes are obtained. A video frame is digitized from that sequence which represents the extreme of mouth motion and shape. This is used to create a database of images of different facial positions relative to spoken phonemes and diphthongs. An audio speech sequence is then used as the element to which a video sequence will be matched. The audio sequence is analyzed to determine spoken phoneme sequences and relative timings. The database is used to obtain images for each of these phonemes and these times, and morphing techniques are used to create transitions between the images. Different parts of the images can be processed in different ways to make a more realistic speech pattern.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.