Patent · US Active

Voice generation with predetermined emotion type

US10803850B2 · kind B2 · utility

0Cited by

5References

18Claims

0Family size

Inventors

Chi-Ho Li · Beijing, CN
Baoxun Wang · Beijing, CN
Max Leung · Beijing, CN

Key dates

Filing date	Sep 8, 2014
Grant date	Oct 13, 2020
Priority date	—
Expiry date	Sep 21, 2035

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/63
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Techniques for generating voice with predetermined emotion type. In an aspect, semantic content and emotion type are separately specified for a speech segment to be generated. A candidate generation module generates a plurality of emotionally diverse candidate speech segments, wherein each candidate has the specified semantic content. A candidate selection module identifies an optimal candidate from amongst the plurality of candidate speech segments, wherein the optimal candidate most closely corresponds to the predetermined emotion type. In further aspects, crowd-sourcing techniques may be applied to generate the plurality of speech output candidates associated with a given semantic content, and machine-learning techniques may be applied to derive parameters for a real-time algorithm for the candidate selection module.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.