Patent · US Expired

Parametric speech codec for representing synthetic speech in the presence of background noise

US7257535B2 · kind B2 · utility

2Cited by

35References

4Claims

0Family size

Assignee

LUCENT TECHNOLOGIES INC. · US

Inventors

Joseph Gerard Aguilar · Lawrenceville, US
Juin-Hwey Chen · Neshanic Station, US
Wei Wang · Beijing, CN
Robert W. Zopf · Rancho Santa Margarita, US

Key dates

Filing date	Oct 28, 2005
Grant date	Aug 14, 2007
Priority date	—
Expiry date	Oct 28, 2025

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/93
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A system and method are provided for processing audio and speech signals using a pitch and voicing dependent spectral estimation algorithm (voicing algorithm) to accurately represent voiced speech, unvoiced speech, and mixed speech in the presence of background noise, and background noise with a single model. The present invention also modifies the synthesis model based on an estimate of the current input signal to improve the perceptual quality of the speech and background noise under a variety of input conditions. The present invention also improves the voicing dependent spectral estimation algorithm robustness by introducing the use of a Multi-Layer Neural Network in the estimation process. The voicing dependent spectral estimation algorithm provides an accurate and robust estimate of the voicing probability under a variety of background noise conditions. This is essential to providing high quality intelligible speech in the presence of background noise.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.