Patent · US Active

Speaker conversion for video games

US11605388B1 · kind B1 · utility

3Cited by
2References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 9, 2020
Grant dateMar 14, 2023
Priority date
Expiry dateMay 18, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2021/0135
  • WIPO fieldFurniture, games
  • WIPO sectorOther fields

Abstract

This specification describes a computer-implemented method of generating speech audio for use in a video game, wherein the speech audio is generated using a voice convertor that has been trained to convert audio data for a source speaker into audio data for a target speaker. The method comprises receiving: (i) source speech audio, and (ii) a target speaker identifier. The source speech audio comprises speech content in the voice of a source speaker. Source acoustic features are determined for the source speech audio. A target speaker embedding associated with the target speaker identifier is generated as output of a speaker encoder of the voice convertor. The target speaker embedding and the source acoustic features are inputted into an acoustic feature encoder of the voice convertor. One or more acoustic feature encodings are generated as output of the acoustic feature encoder. The one or more acoustic feature encodings are derived from the target speaker embedding and the source acoustic features. Target speech audio is generated for the target speaker. The target speech audio comprises the speech content in the voice of the target speaker. The generating comprises decoding the o…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.