Client/server architecture for text-to-speech synthesis
US6810379B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Apr 24, 2001 |
| Grant date | Oct 26, 2004 |
| Priority date | — |
| Expiry date | Sep 7, 2022 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L13/07
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A client/server text-to-speech synthesis system and method divides the method optimally between client and server. The server stores large databases for pronunciation analysis, prosody generation, and acoustic unit selection corresponding to a normalized text, while the client performs computationally intensive decompression and concatenation of selected acoustic units to generate speech. The units are transmitted from the client to the server in a highly compressed format, with a compression method selected based on the predetermined set of potential acoustic units. This compression method allows for very high-quality and natural-sounding speech to be output at the client machine.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.