Patent · US Active

Speech conversion method and apparatus, storage medium, and electronic device

US12223973B1 · kind B1 · utility

0Cited by
4References
7Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 9, 2024
Grant dateFeb 11, 2025
Priority date
Expiry dateAug 9, 2044

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2021/0135
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Embodiments of the present application provide a speech conversion method and apparatus, a storage medium, and an electronic device. The method includes: acquiring a source speech to be converted and a target speech sample of a target speaker; recognizing a style category of the target speech sample, and extracting a target audio feature from the target speech sample according to the style category; extracting a source audio feature from the source speech; acquiring a first style feature of the target speech sample and determining a second style feature of the target speech sample according to the first style feature; fusing and mapping the source audio feature, the target audio feature, and the second style feature to obtain a joint encoding feature; and decoding the joint encoding feature, to obtain a target speech feature, and converting the source speech based on the target speech feature to obtain a target speech.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.