Patent · US Active

Learning singing from speech

US12308019B2 · kind B2 · utility

0Cited by

5References

16Claims

0Family size

Assignee

TENCENT AMERICA LLC · US

Inventors

Chengzhu Yu · Bellevue, US
Heng Lu · Sammamish, US
Chao Weng · Fremont, US
Dong Yu · Bellevue, US

Key dates

Filing date	Jul 11, 2022
Grant date	May 20, 2025
Priority date	—
Expiry date	Aug 9, 2043

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/025
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method, computer program, and computer system is provided for converting a singing voice of a first person associated with a first speaker to a singing voice of a second person using a speaking voice of the second person associated with a second speaker. A context associated with one or more phonemes corresponding to the singing voice of a first person is encoded, and the one or more phonemes are aligned to one or more target acoustic frames based on the encoded context. One or more mel-spectrogram features are recursively generated from the aligned phonemes, the target acoustic frames, and a sample of the speaking voice of the second person. A sample corresponding to the singing voice of a first person is converted to a sample corresponding to the second singing voice using the generated mel-spectrogram features.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.