Method and system for distinguishing speech from music in a digital audio signal in real time
US7191128B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 21, 2003 |
| Grant date | Mar 13, 2007 |
| Priority date | — |
| Expiry date | Jul 13, 2025 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/78
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The present invention relates to method and system for distinguishing speech from music in a digital audio signal in real time. A method for distinguishing speech from music in a digital audio signal in real time for the sound segments that have been segmented from an input signal of the digital sound processing systems by means of a segmentation unit on the base of homogeneity of their properties, comprises the steps of: (a) framing an input signal into sequence of overlapped frames by a windowing function; (b) calculating frame spectrum for every frame by FFT transform; (c) calculating segment harmony measure on base of frame spectrum sequence; (d) calculating segment noise measure on base of the frame spectrum sequence; (e) calculating segment tail measure on base of the frame spectrum sequence; (f) calculating segment drag out measure on base of the frame spectrum sequence; (g) calculating segment rhythm measure on base of the frame spectrum sequence; and (h) making the distinguishing decision based on characteristics calculated.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.