Perceptual harmonic cepstral coefficients as the front-end for speech recognition
US7756700B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 1, 2008 |
| Grant date | Jul 13, 2010 |
| Priority date | — |
| Expiry date | Sep 14, 2028 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2025/935
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Pitch estimation and classification into voiced, unvoiced and transitional speech were performed by a spectro-temporal auto-correlation technique. A peak picking formula was then employed. A weighing function was then applied to the power spectrum. The harmonics weighted power spectrum underwent mel-scaled band-pass filtering, and the log-energy of the filter's output was discrete cosine transformed to produce cepstral coefficients. A within-filter cubic-root amplitude compression was applied to reduce amplitude variation without compromise of the gain invariance properties.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.