Speech conversion method and apparatus, storage medium, and electronic device
US12223973B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Aug 9, 2024 |
| Grant date | Feb 11, 2025 |
| Priority date | — |
| Expiry date | Aug 9, 2044 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2021/0135
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Embodiments of the present application provide a speech conversion method and apparatus, a storage medium, and an electronic device. The method includes: acquiring a source speech to be converted and a target speech sample of a target speaker; recognizing a style category of the target speech sample, and extracting a target audio feature from the target speech sample according to the style category; extracting a source audio feature from the source speech; acquiring a first style feature of the target speech sample and determining a second style feature of the target speech sample according to the first style feature; fusing and mapping the source audio feature, the target audio feature, and the second style feature to obtain a joint encoding feature; and decoding the joint encoding feature, to obtain a target speech feature, and converting the source speech based on the target speech feature to obtain a target speech.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.