Patent · US Active

Method and apparatus for voice activity detection

US10381024B2 · kind B2 · utility

2Cited by
0References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 27, 2017
Grant dateAug 13, 2019
Priority date
Expiry dateJan 13, 2038

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2025/786
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A voice activity detection system (100) filters audio input frames (102), on a frame=by-frame basis through a gammatone filterbank (104) to generate filtered gammatone output signals (106). A signal energy calculator (108) takes the filtered gammatone output signals and generates a plurality of energy envelopes. Weighting factors are constructed (112) are applied to each of the energy envelopes thereby producing normalized weighted signal (116), in which voice regions are emphasized and noise regions are minimized. An entropy measurement (118) is taken to extract information from the normalized weighted signals (116) and generate an entropy signal (120). The entropy signal (120) is averaged and compared to an adaptive entropy threshold (122), indicative of a noise floor. Decision logic (124) is used to identifying speech and noise from the comparison of the averaged entropy signal to the adaptive entropy threshold.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.