Patent · US Active

Text-to-speech from media content item snippets

US11710474B2 · kind B2 · utility

0Cited by

9References

20Claims

0Family size

Assignee

Spotify AB · SE

Inventors

Rohit Kumar · Bengaluru, IN
Henrik Lindström · Åkersberga, SE
Henriette Cramer · San Francisco, US
Sarah Mennicken · San Francisco, US
Sravana Reddy · Cambridge, US
Jennifer Thom-Santelli · Medford, US

Key dates

Filing date	Jan 12, 2021
Grant date	Jul 25, 2023
Priority date	—
Expiry date	Oct 11, 2041

Classification

Technology area (CPC G)Physics
CPC primaryG10L13/04
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.