Patent · US Active

System and method for distributed voice models across cloud and device for embedded text-to-speech

US9218804B2 · kind B2 · utility

4Cited by

11References

20Claims

0Family size

Assignee

AT&T Intellectual Property I, L.P. · US

Inventors

Benjamin J. Stern · Morristown, US
Mark Charles Beutnagel · Mendham, US
Alistair D. Conkie · San Jose, US
Horst J. Schroeter · New Providence, US
Amanda Stent · Chatham, US

Key dates

Filing date	Sep 12, 2013
Grant date	Dec 22, 2015
Priority date	—
Expiry date	Jun 19, 2034

Classification

Technology area (CPC G)Physics
CPC primaryG10L13/07
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Systems, methods, and computer-readable storage media for intelligent caching of concatenative speech units for use in speech synthesis. A system configured to practice the method can identify a speech synthesis context, and determine, based on a local cache of text-to-speech units for a text-to-speech voice and based on the speech synthesis context, additional text-to-speech units which are not in the local cache. The system can request from a server the additional text-to-speech units, and store the additional text-to-speech units in the local cache. The system can then synthesize speech using the text-to-speech units and the additional text-to-speech units in the local cache. The system can prune the cache as the context changes, based on availability of local storage, or after synthesizing the speech. The local cache can store a core set of text-to-speech units associated with the text-to-speech voice that cannot be pruned from the local cache.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.