Method for synthesized speech generation using emotion information correction and apparatus
US11636845B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 14, 2020 |
| Grant date | Apr 25, 2023 |
| Priority date | — |
| Expiry date | Apr 8, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L13/04
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method includes generating first synthesized speech by using text and a first emotion vector configured for the text, extracting a second emotion vector included in the first synthesized speech, determining whether correction of the second emotion information vector is needed by comparing a loss value calculated by using the first emotion information vector and the second emotion information vector with a preconfigured threshold, re-performing speech synthesis by using a third emotion information vector generated by correcting the second emotion information vector, and outputting the generated synthesized speech, thereby configuring emotion information of speech in a more effective manner. A speech synthesis apparatus may be associated with an artificial intelligence module, drone (unmanned aerial vehicle, UAV), robot, augmented reality (AR) devices, virtual reality (VR) devices, devices related to 5G services, and the like.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.