Model-based voice activity detection system and method using a log-likelihood ratio and pitch
US6615170B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 7, 2000 |
| Grant date | Sep 2, 2003 |
| Priority date | — |
| Expiry date | Mar 7, 2020 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/78
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system and method for voice activity detection, in accordance with the invention includes the steps of inputting data including frames of speech and noise, and deciding if the frames of the input data include speech or noise by employing a log-likelihood ratio test statistic and pitch. The frames of the input data are tagged based on the log-likelihood ratio test statistic and pitch characteristics of the input data as being most likely noise or most likely speech. The tags are counted in a plurality of frames to determine if the input data is speech or noise.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.