Voice generation with predetermined emotion type
US10803850B2 · kind B2 · utility
Inventors
Key dates
| Filing date | Sep 8, 2014 |
| Grant date | Oct 13, 2020 |
| Priority date | — |
| Expiry date | Sep 21, 2035 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/63
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques for generating voice with predetermined emotion type. In an aspect, semantic content and emotion type are separately specified for a speech segment to be generated. A candidate generation module generates a plurality of emotionally diverse candidate speech segments, wherein each candidate has the specified semantic content. A candidate selection module identifies an optimal candidate from amongst the plurality of candidate speech segments, wherein the optimal candidate most closely corresponds to the predetermined emotion type. In further aspects, crowd-sourcing techniques may be applied to generate the plurality of speech output candidates associated with a given semantic content, and machine-learning techniques may be applied to derive parameters for a real-time algorithm for the candidate selection module.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.