Method of deriving characteristics values from a speech signal
US6041296A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Apr 21, 1997 |
| Grant date | Mar 21, 2000 |
| Priority date | — |
| Expiry date | Apr 21, 2017 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/21
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
In a frequently used speech synthesis for voice output an excitation signal is applied to a number of resonators whose frequency and amplitude are adjusted in accordance with the sound to be produced. These parameters for adjusting the resonators may be gained from natural speech signals. Such parameters gained from natural speech signals may also be used for speech recognition, in which these parameter values are compared with comparison values. According to the invention, the parameters, particularly the formant frequencies, are determined by forming the power density spectrum via discrete frequencies from which autocorrelation coefficients are formed for consecutive frequency segments of the power density spectrum from which, in turn, error values are formed, while the sum of the error values is minimized over all segments and the optimum boundary frequencies of the segments are determined for this minimum. Via the autocorrelation coefficients, the LPC predictor coefficients can then be computed, from which coefficients the formant frequency is computed. The minimum of the error sum for the individual segments is found by way of dynamic programming, in which auxiliary values are…
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.