Patent · US Active

Generating a voice model for a user

US11430424B2 · kind B2 · utility

0Cited by

0References

20Claims

0Family size

Assignee

Meta Platforms Technologies, LLC · US

Inventors

Lior Wolf · Cambridge, US
David Vazquez · Madrid, ES
Tali Zvi · San Carlos, US
Yaniv Taigman · Haifa, IL
Adam Polyak · Nes Ziona, IL
Hyunbin Park · Palo Alto, US

Key dates

Filing date	Nov 13, 2019
Grant date	Aug 30, 2022
Priority date	—
Expiry date	Mar 4, 2040

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/30
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Disclosed herein a system, a method and a device for generating a voice model for a user. A device can include an encoder and a decoder to generate a voice model for converting text to an audio output that resembles a voice of the person sending respective text. The encoder can includes a neural network and can receive a plurality of audio samples from a user. The encoder can generate a sequence of values and provide the sequence of values to the decoder. The decoder can establish, using the sequence of values and one or more speaker embeddings of the user, a voice model corresponding to the plurality of audio samples of the user.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.