Patent · US Expired

Method and apparatus for synthesizing realistic animations of a human speaking using a computer

US6232965A · kind A · utility

31Cited by

6References

68Claims

0Family size

Assignee

CALIFORNIA INSTITUTE OF TECHNOLOGY · US

Inventors

Kenneth C. Scott · Ottawa, US
Matthew C. Yeates · La Cañada Flintridge, US
David S. Kagels · Pasadena, US
Stephen Hilary Watson · Pasadena, US

Key dates

Filing date	Nov 30, 1994
Grant date	May 15, 2001
Priority date	—
Expiry date	Nov 30, 2014

Classification

Technology area (CPC G)Physics
CPC primaryG10L2021/105
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method and apparatus for synthesizing speech or facial movements to match selected speech sequences. A videotape of an arbitrary text sequence is obtained including a plurality of images of a user speaking various sequences. Video images corresponding to specific spoken phonemes are obtained. A video frame is digitized from that sequence which represents the extreme of mouth motion and shape. This is used to create a database of images of different facial positions relative to spoken phonemes and diphthongs. An audio speech sequence is then used as the element to which a video sequence will be matched. The audio sequence is analyzed to determine spoken phoneme sequences and relative timings. The database is used to obtain images for each of these phonemes and these times, and morphing techniques are used to create transitions between the images. Different parts of the images can be processed in different ways to make a more realistic speech pattern.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.