Modifications in the multi-band excitation (MBE) model for generating high quality speech at low bit rates
US6963833B1 · kind B1 · utility
Assignee
Inventor
Key dates
| Filing date | Oct 26, 2000 |
| Grant date | Nov 8, 2005 |
| Priority date | — |
| Expiry date | Mar 15, 2022 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/90
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The invention relates to improving parameter estimation and speech synthesis. Pursuant to one aspect of the invention, a path of pitch candidates having low errors is tracked to determine a pitch estimate. Pursuant to another aspect of the invention, a number of parameters are used to classify speech segments. Pursuant to another aspect of the invention, a voicing parameter is determined using a threshold value and bands are marked voiced or unvoiced depending on two error functions that compare synthesized voiced and unvoiced spectra to an original speech spectrum. Pursuant to another aspect of the invention a voicing parameter is used to facilitate lower bits for transmitting voicing decisions. Last, pursuant to other aspects of the invention, unvoiced speech is synthesized by incorporating a random generator, and harmonics phases are initialized with a fixed set of values.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.