Method of speech segment selection for concatenative synthesis based on prosody-aligned distance measure
US7315813B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 29, 2002 |
| Grant date | Jan 1, 2008 |
| Priority date | — |
| Expiry date | Nov 29, 2024 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L13/04
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method of speech segment selection for concatenative synthesis based on prosody-aligned distance measure is disclosed. This method is based on comparison of speech segments segmented from a speech corpus, wherein speech segments are fully prosody-aligned to each other before distortion measure. With prosody alignment embedded in selection process, distortion resulting from possible prosody modification in synthesis could be taken into account objectively in selection phase. In order to carry out the purpose of the present invention, automatic segmentation, pitch marking and PSOLA method work together for prosody alignment. Two distortion measures, MFCC and PSQM are used for comparing two prosody-aligned segments of speech because of human perceptual consideration.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.