Patent · US Active

Language agnostic automated voice activity detection

US11205445B1 · kind B1 · utility

5Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 10, 2019
Grant dateDec 21, 2021
Priority date
Expiry dateFeb 4, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L25/30
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

Systems, methods, and computer-readable media are disclosed for systems and methods for language agnostic automated voice activity detection. Example methods may include determining an audio file associated with video content, generating a number of audio segments using the audio file, the plurality of audio segments including a first segment and a second segment, where the first segment and the second segment are consecutive segments. Example methods may include determining, using a Gated Recurrent Unit neural network, that the first segment includes first voice activity, determining, using the Gated Recurrent Unit neural network, that the second segment includes second voice activity, and determining that voice activity is present between a first timestamp associated with the first segment and a second timestamp associated with the second segment.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.