Patent · US Expired

Employing speech models in concatenative speech synthesis

US6950798B1 · kind B1 · utility

26Cited by

8References

41Claims

0Family size

Assignee

AT&T CORP. · US

Inventors

Mark Charles Beutnagel · Mendham, US
David A. Kapilow · Berkeley Heights, US
Ioannis Stylianou · Madison, US
Ann K. Syrdal · Morristown, US

Key dates

Filing date	Mar 2, 2002
Grant date	Sep 27, 2005
Priority date	—
Expiry date	Jan 23, 2024

Classification

Technology area (CPC G)Physics
CPC primaryG10L13/07
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A text-to-speech synthesizer employs database that includes units. For each unit there is a collection of unit selection parameters and a plurality of frames. Each frame has a set of model parameters derived from a base speech frame, and a speech frame synthesized from the frame's model parameters. A text to be synthesized is converted to a sequence of desired unit features sets, and for each such set the database is perused to retrieve a best-matching unit. An assessment is made whether modifications to the frames are needed, because of discontinuities in the model parameters at unit boundaries, or because of differences between the desired and selected unit features. When modifications are necessary, the model parameters of frames that need to be altered are modified, and new frames are synthesized from the modified model parameters and concatenated to the output. Otherwise, the speech frames previously stored in the database are retrieved and concatenated to the output.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.