Patent · US Active

Detecting a user's voice activity using dynamic probabilistic models of speech features

US9378755B2 · kind B2 · utility

0Cited by
3References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 30, 2014
Grant dateJun 28, 2016
Priority date
Expiry dateNov 14, 2034

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L25/78
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Method of detecting voice activity starts with by generating probabilistic models that respectively model features of speech dynamically over time. Probabilistic models may model each feature dependent on a past feature and a current state. Features of speech may include a nonstationary signal presence feature, a periodicity feature, and a sparsity feature. Noise suppressor may then perform noise suppression on an acoustic signal to generate a nonstationary signal presence signal and a noise suppressed acoustic signal. An LPC module may then perform residual analysis on the noise suppressed data signal to generate a periodicity signal and a sparsity signal. Inference generator receives the probabilistic models and receives, in real-time, nonstationary signal presence signal, periodicity signal, and sparsity signal. Inference generator may then generate in real time an estimate of voice activity based on the probabilistic models, nonstationary signal presence signal, periodicity signal, and sparsity signal. Other embodiments are also described.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.