Patent · US Active

Emotion-based text to speech

US12142257B2 · kind B2 · utility

2Cited by

178References

20Claims

0Family size

Assignee

SNAP INC. · US

Inventors

Liron Harazi · Elad, IL
Jacob Assa · New York, US
Alan Bekker · Givat Shmuel, IL

Key dates

Filing date	Feb 8, 2022
Grant date	Nov 12, 2024
Priority date	—
Expiry date	Oct 3, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/63
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Systems and methods are provided for providing emotion-based text to speech. The systems and methods perform operations comprising accessing a text string; storing a plurality of embeddings associated with a plurality of speakers, a first embedding for a first speaker being associated with a first emotion and a second embedding for a second speaker of the plurality of speakers being associated with a second emotion; selecting the first speaker to speak one or more words of the text string; determining that the one or more words are associated with the second emotion; generating, based on the first embedding and the second embedding, a third embedding for the first speaker associated with the second emotion; and applying the third embedding and the text string to a vocoder to generate an audio stream comprising the one or more words being spoken by the first speaker with the second emotion.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.