Patent · US Expired

Parametric speech codec for representing synthetic speech in the presence of background noise

US7257535B2 · kind B2 · utility

2Cited by
35References
4Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 28, 2005
Grant dateAug 14, 2007
Priority date
Expiry dateOct 28, 2025

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L25/93
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and method are provided for processing audio and speech signals using a pitch and voicing dependent spectral estimation algorithm (voicing algorithm) to accurately represent voiced speech, unvoiced speech, and mixed speech in the presence of background noise, and background noise with a single model. The present invention also modifies the synthesis model based on an estimate of the current input signal to improve the perceptual quality of the speech and background noise under a variety of input conditions. The present invention also improves the voicing dependent spectral estimation algorithm robustness by introducing the use of a Multi-Layer Neural Network in the estimation process. The voicing dependent spectral estimation algorithm provides an accurate and robust estimate of the voicing probability under a variety of background noise conditions. This is essential to providing high quality intelligible speech in the presence of background noise.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.