Voice detection in audio signals
US6321194A · kind A · utility
Assignee
Inventor
Key dates
| Filing date | Apr 27, 1999 |
| Grant date | Nov 20, 2001 |
| Priority date | — |
| Expiry date | Apr 27, 2019 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/33
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The presence of a voice in an audio signal is detected by sampling frequency components of the audio signal during a window that starts when a power of the audio signal reaches a predetermined threshold and stops when the audio signal's power drops below the predetermined threshold. An array of elements is generated based on the sampled frequency components. Each element in the array corresponds to a time-based sum of frequency components. Whether the audio signal corresponds to a voice is determined using one or values calculated from the generated array. The value may correspond either to a frequency-based sum of array elements or to the window. The calculated values are analyzed using fuzzy logic which generates a measure of a likelihood that the audio signal is a voice.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.