Patent · US Active

Multichannel audio speech classification

US11900961B2 · kind B2 · utility

0Cited by
5References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 31, 2022
Grant dateFeb 13, 2024
Priority date
Expiry dateMay 31, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2021/02087
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Examples of the present disclosure describe systems and methods for multichannel audio speech classification. In examples, an audio signal comprising multiple audio channels is received at a processing device. Each of the audio channels in the audio signal is transcoded to a predefined audio format. For each of the transcoded audio channels, an average power value is calculated for one or more data windows in the audio signal. A correlation value is calculated between the average power value for each audio channel and the combined average power value of the other audio channels in the audio signal. Each of the correlation values (or an aggregated correlation value for the audio channels) is then compared against a threshold value to determine whether the audio signal is to be classified as a speech-based communication. Based on the classification, an action associated with the audio signal may be performed.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.