Patent · US Active

Voice generation with predetermined emotion type

US10803850B2 · kind B2 · utility

0Cited by
5References
18Claims
0Family size

Inventors

Key dates

Filing dateSep 8, 2014
Grant dateOct 13, 2020
Priority date
Expiry dateSep 21, 2035

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L25/63
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques for generating voice with predetermined emotion type. In an aspect, semantic content and emotion type are separately specified for a speech segment to be generated. A candidate generation module generates a plurality of emotionally diverse candidate speech segments, wherein each candidate has the specified semantic content. A candidate selection module identifies an optimal candidate from amongst the plurality of candidate speech segments, wherein the optimal candidate most closely corresponds to the predetermined emotion type. In further aspects, crowd-sourcing techniques may be applied to generate the plurality of speech output candidates associated with a given semantic content, and machine-learning techniques may be applied to derive parameters for a real-time algorithm for the candidate selection module.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.