Patent · US Active

Methods and apparatus for predicting prosody in speech synthesis

US9286886B2 · kind B2 · utility

6Cited by
12References
60Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 24, 2011
Grant dateMar 15, 2016
Priority date
Expiry dateDec 24, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L13/08
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques for predicting prosody in speech synthesis may make use of a data set of example text fragments with corresponding aligned spoken audio. To predict prosody for synthesizing an input text, the input text may be compared with the data set of example text fragments to select a best matching sequence of one or more example text fragments, each example text fragment in the sequence being paired with a portion of the input text. The selected example text fragment sequence may be aligned with the input text, e.g., at the word level, such that prosody may be extracted from the audio aligned with the example text fragments, and the extracted prosody may be applied to the synthesis of the input text using the alignment between the input text and the example text fragments.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.