Patent · US Expired

Method of speech segment selection for concatenative synthesis based on prosody-aligned distance measure

US7315813B2 · kind B2 · utility

8Cited by

2References

11Claims

0Family size

Assignee

Industrial Technology Research Institute · TW

Inventors

Chih-Chung Kuo · Tainan, TW
Chi-Shiang Kuo · Luodong, TW

Key dates

Filing date	Jul 29, 2002
Grant date	Jan 1, 2008
Priority date	—
Expiry date	Nov 29, 2024

Classification

Technology area (CPC G)Physics
CPC primaryG10L13/04
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method of speech segment selection for concatenative synthesis based on prosody-aligned distance measure is disclosed. This method is based on comparison of speech segments segmented from a speech corpus, wherein speech segments are fully prosody-aligned to each other before distortion measure. With prosody alignment embedded in selection process, distortion resulting from possible prosody modification in synthesis could be taken into account objectively in selection phase. In order to carry out the purpose of the present invention, automatic segmentation, pitch marking and PSOLA method work together for prosody alignment. Two distortion measures, MFCC and PSQM are used for comparing two prosody-aligned segments of speech because of human perceptual consideration.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.