Multi-feature speech/music discrimination system
US6570991B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 18, 1996 |
| Grant date | May 27, 2003 |
| Priority date | — |
| Expiry date | Dec 18, 2016 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/51
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A speech/music discriminator employs data from multiple features of an audio signal as input to a classifier. Some of the feature data is determined from individual frames of the audio signal, and other input data is based upon variations of a feature over several frames, to distinguish the changes in voiced and unvoiced components of speech from the more constant characteristics of music. Several different types of classifiers for labeling test points on the basis of the feature data are disclosed. A preferred set of classifiers is based upon variations of a nearest-neighbor approach, including a K-d tree spatial partitioning technique.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.