Method, device and system for using statistical information to reduce computation and memory requirements of a neural network based speech synthesis system
US5913194A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Jul 14, 1997 |
| Grant date | Jun 15, 1999 |
| Priority date | — |
| Expiry date | Jul 14, 2017 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/30
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method (400), device and system (300) provide, in response to linguistic information, efficient generation of a parametric representation of speech using a neural network. The method provides, in response to linguistic information efficient generation of a refined parametric representation of speech, comprising the steps of: A) using a data selection module to retrieve representative parameter vectors for each segment description according to the phonetic segment type and the phonetic segment types included in adjacent segment descriptions; B) interpolating between the representative parameter vectors according to the segment descriptions and duration to provide interpolated statistical parameters; C) converting the interpolated statistical parameters and linguistic information to neural network input parameters; D) utilizing a statistically enhanced neural network/neural network with post-processor to provide neural network output parameters that correspond to a parametric representation of speech; and converting the neural network output parameters to a refined parametric representation of speech.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.