Patent · US Active

Method for synthesized speech generation using emotion information correction and apparatus

US11636845B2 · kind B2 · utility

0Cited by
1References
13Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 14, 2020
Grant dateApr 25, 2023
Priority date
Expiry dateApr 8, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L13/04
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method includes generating first synthesized speech by using text and a first emotion vector configured for the text, extracting a second emotion vector included in the first synthesized speech, determining whether correction of the second emotion information vector is needed by comparing a loss value calculated by using the first emotion information vector and the second emotion information vector with a preconfigured threshold, re-performing speech synthesis by using a third emotion information vector generated by correcting the second emotion information vector, and outputting the generated synthesized speech, thereby configuring emotion information of speech in a more effective manner. A speech synthesis apparatus may be associated with an artificial intelligence module, drone (unmanned aerial vehicle, UAV), robot, augmented reality (AR) devices, virtual reality (VR) devices, devices related to 5G services, and the like.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.