Patent · US Active

Real-time class recognition for an audio stream

US11024291B2 · kind B2 · utility

4Cited by
15References
33Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 27, 2019
Grant dateJun 1, 2021
Priority date
Expiry dateJun 24, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

In an embodiment, the disclosed technologies include automatically recognizing speech content of an audio stream that may contain multiple different classes of speech content, by receiving, by an audio capture device, an audio stream; outputting, by one or more classifiers, in response to an inputting to the one or more classifiers of digital data that has been extracted from the audio stream, score data; where a score of the score data indicates a likelihood that a particular time segment of the audio stream contains speech of a particular class; where the one or more classifiers use one or more machine-learned models that have been trained to recognize audio of one or more particular classes to determine the score data; using a sliding time window process, selecting particular scores from the score data; using the selected particular scores, determining and outputting one or more decisions as to whether one or more particular time segments of the audio stream contain speech of one or more particular classes; where the one or more decisions are outputted within a real-time time interval of the receipt of the audio stream; where the one or more decisions are used by downstream proc…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.