Method for automatic analysis of audio including music and speech
US6542869B1 · kind B1 · utility
Assignee
Inventor
Key dates
| Filing date | May 11, 2000 |
| Grant date | Apr 1, 2003 |
| Priority date | — |
| Expiry date | May 11, 2020 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L19/02
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for determining points of change or novelty in an audio signal measures the self similarity of components of the audio signal. For each time window in an audio signal, a formula is used to determine a vector parameterization value. The self-similarity as well as cross-similarity between each of the parameterization values is then determined for all past and future window regions. A significant point of novelty or change will have a high self-similarity in the past and future, and a low cross-similarity. The extent of the time difference between “past” and “future” can be varied to change the scale of the system so that, for example, individual musical notes can be found using a short time extent while longer events, such as musical themes or changing of speakers, can be identified by considering windows further into the past or future. The result is a measure of the degree of change, or how novel the source audio is at any time. The method can be used in a wide variety of applications, including segmenting or indexing for classification and retrieval, beat tracking, and summarizing of speech or music.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.