Patent · US Active

Automatic smoothed captioning of non-speech sounds from audio

US10037313B2 · kind B2 · utility

3Cited by

0References

17Claims

0Family size

Assignee

Google LLC · US

Inventors

Fangzhou Wang · Beijing, CN
Sourish Chaudhuri · San Francisco, US
Daniel Ellis · New York, US
Nathan Reale · San Francisco, US

Key dates

Filing date	Aug 23, 2016
Grant date	Jul 31, 2018
Priority date	—
Expiry date	Nov 11, 2036

Classification

Technology area (CPC G)Physics
CPC primaryG10L2025/783
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A content server accessing an audio stream, and inputs portions of the audio stream into one or more non-speech classifiers for classification, the non-speech classifiers generating, for portions of the audio stream, a set of raw scores representing likelihoods that the respective portion of the audio stream includes an occurrence of a particular class of non-speech sounds associated with each of the non-speech classifiers. The content server generates binary scores for the sets of raw scores, the binary scores generated based on a smoothing of a respective set of raw scores. The content server applies a set of non-speech captions to portions of the audio stream in time, each of the sets of non-speech captions based on a different one of the set binary scores of the corresponding portion of the audio stream.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.