Patent · US Active

Method for synthesized speech generation using emotion information correction and apparatus

US11636845B2 · kind B2 · utility

0Cited by

1References

13Claims

0Family size

Assignee

LG ELECTRONICS INC. · KR

Inventors

Siyoung YANG · Seoul, KR
Yongchul PARK · Seoul, KR
Sungmin HAN · Seoul, KR
Sangki Kim · Yongin-si, KR
Juyeong JANG · Seoul, KR
Minook Kim · Seoul, KR

Key dates

Filing date	Jul 14, 2020
Grant date	Apr 25, 2023
Priority date	—
Expiry date	Apr 8, 2041

Classification

Technology area (CPC G)Physics
CPC primaryG10L13/04
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method includes generating first synthesized speech by using text and a first emotion vector configured for the text, extracting a second emotion vector included in the first synthesized speech, determining whether correction of the second emotion information vector is needed by comparing a loss value calculated by using the first emotion information vector and the second emotion information vector with a preconfigured threshold, re-performing speech synthesis by using a third emotion information vector generated by correcting the second emotion information vector, and outputting the generated synthesized speech, thereby configuring emotion information of speech in a more effective manner. A speech synthesis apparatus may be associated with an artificial intelligence module, drone (unmanned aerial vehicle, UAV), robot, augmented reality (AR) devices, virtual reality (VR) devices, devices related to 5G services, and the like.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.