Patent · US Expired

Method of speech segment selection for concatenative synthesis based on prosody-aligned distance measure

US7315813B2 · kind B2 · utility

8Cited by
2References
11Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 29, 2002
Grant dateJan 1, 2008
Priority date
Expiry dateNov 29, 2024

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L13/04
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method of speech segment selection for concatenative synthesis based on prosody-aligned distance measure is disclosed. This method is based on comparison of speech segments segmented from a speech corpus, wherein speech segments are fully prosody-aligned to each other before distortion measure. With prosody alignment embedded in selection process, distortion resulting from possible prosody modification in synthesis could be taken into account objectively in selection phase. In order to carry out the purpose of the present invention, automatic segmentation, pitch marking and PSOLA method work together for prosody alignment. Two distortion measures, MFCC and PSQM are used for comparing two prosody-aligned segments of speech because of human perceptual consideration.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.