Synthesizing speech from text
US8249874B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 25, 2008 |
| Grant date | Aug 21, 2012 |
| Priority date | — |
| Expiry date | Aug 29, 2030 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L13/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Speech is synthesized for a given text by determining a sequence of phonetic components based on the text, determining a sequence of target phonetic elements associated phonetic components, determining a sequence of target event types associated with the phonetic components and determining a sequence of speech units from a plurality of stored speech unit candidates by use of a cost function. The cost function comprises a unit cost, a concatenation cost, and an event type cost for each speech unit in the sequence of speech units. The unit cost of a speech unit is determined with respect to the corresponding target phonetic element, while the concatenation cost of a speech unit is determined with respect to adjacent speech units and the event type cost of each speech unit is determined with respect to the corresponding target event type.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.