Method and apparatus for voice activity detection
US10381024B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Apr 27, 2017 |
| Grant date | Aug 13, 2019 |
| Priority date | — |
| Expiry date | Jan 13, 2038 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2025/786
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A voice activity detection system (100) filters audio input frames (102), on a frame=by-frame basis through a gammatone filterbank (104) to generate filtered gammatone output signals (106). A signal energy calculator (108) takes the filtered gammatone output signals and generates a plurality of energy envelopes. Weighting factors are constructed (112) are applied to each of the energy envelopes thereby producing normalized weighted signal (116), in which voice regions are emphasized and noise regions are minimized. An entropy measurement (118) is taken to extract information from the normalized weighted signals (116) and generate an entropy signal (120). The entropy signal (120) is averaged and compared to an adaptive entropy threshold (122), indicative of a noise floor. Decision logic (124) is used to identifying speech and noise from the comparison of the averaged entropy signal to the adaptive entropy threshold.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.