Patent · US Active

Real-time class recognition for an audio stream

US11024291B2 · kind B2 · utility

4Cited by

15References

33Claims

0Family size

Assignee

SRI International · US

Inventors

Diego Castan Lavilla · Mountain View, US
Harry Bratt · Mountain View, US
Mitchell Leigh McLaren · Dajarra, AU

Key dates

Filing date	Mar 27, 2019
Grant date	Jun 1, 2021
Priority date	—
Expiry date	Jun 24, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/00
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

In an embodiment, the disclosed technologies include automatically recognizing speech content of an audio stream that may contain multiple different classes of speech content, by receiving, by an audio capture device, an audio stream; outputting, by one or more classifiers, in response to an inputting to the one or more classifiers of digital data that has been extracted from the audio stream, score data; where a score of the score data indicates a likelihood that a particular time segment of the audio stream contains speech of a particular class; where the one or more classifiers use one or more machine-learned models that have been trained to recognize audio of one or more particular classes to determine the score data; using a sliding time window process, selecting particular scores from the score data; using the selected particular scores, determining and outputting one or more decisions as to whether one or more particular time segments of the audio stream contain speech of one or more particular classes; where the one or more decisions are outputted within a real-time time interval of the receipt of the audio stream; where the one or more decisions are used by downstream proc…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.