Patent · US Active

Detecting a user's voice activity using dynamic probabilistic models of speech features

US9378755B2 · kind B2 · utility

0Cited by

3References

20Claims

0Family size

Assignee

Apple Inc. · US

Inventors

Harvey D. Thornburg · Sunnyvale, US
Charles Pascal Clark · Mountain View, US

Key dates

Filing date	Sep 30, 2014
Grant date	Jun 28, 2016
Priority date	—
Expiry date	Nov 14, 2034

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/78
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Method of detecting voice activity starts with by generating probabilistic models that respectively model features of speech dynamically over time. Probabilistic models may model each feature dependent on a past feature and a current state. Features of speech may include a nonstationary signal presence feature, a periodicity feature, and a sparsity feature. Noise suppressor may then perform noise suppression on an acoustic signal to generate a nonstationary signal presence signal and a noise suppressed acoustic signal. An LPC module may then perform residual analysis on the noise suppressed data signal to generate a periodicity signal and a sparsity signal. Inference generator receives the probabilistic models and receives, in real-time, nonstationary signal presence signal, periodicity signal, and sparsity signal. Inference generator may then generate in real time an estimate of voice activity based on the probabilistic models, nonstationary signal presence signal, periodicity signal, and sparsity signal. Other embodiments are also described.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.