Patent · US Active

Unit-selection text-to-speech synthesis based on predicted concatenation parameters

US9934775B2 · kind B2 · utility

25Cited by

2,115References

28Claims

0Family size

Assignee

Apple Inc. · US

Inventors

Tuomo J. Raitio · Sunnyvale, US
Kishore Sunkeswari Prahallad · Cupertino, US
Alistair D. Conkie · San Jose, US
Ladan Golipour · Morristown, US
David A. Winarsky · Austin, US

Key dates

Filing date	Sep 15, 2016
Grant date	Apr 3, 2018
Priority date	—
Expiry date	Sep 15, 2036

Classification

Technology area (CPC G)Physics
CPC primaryG10L13/07
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Systems and processes for performing unit-selection text-to-speech synthesis are provided. In an example process, text to be converted to speech is received. The text is represented as a sequence of target units. A plurality of candidate speech segments corresponding to the sequence of target units are selected. Predicted statistical parameters of acoustic features associated with the sequence of target units are determined. The predicted statistical parameters of acoustic features are used to determine target costs and concatenation costs associated with the plurality of candidate speech segments. Based on a combined cost determined from the target costs and concatenation costs, a subset of candidate speech segments is selected from the plurality of candidate speech segments. Speech corresponding to the received text is generated using the subset of candidate speech segments.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.