Patent · US Active

Language agnostic automated voice activity detection

US11869537B1 · kind B1 · utility

0Cited by
3References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 10, 2021
Grant dateJan 9, 2024
Priority date
Expiry dateMar 9, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L25/30
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

Systems, methods, and computer-readable media are disclosed for systems and methods for language agnostic automated voice activity detection. Example methods may include determining an audio file associated with video content, generating audio segments using the audio file, the audio segments including a first segment and a second segment, and determining that the first segment includes first voice activity. Methods may include determining that the second segment comprises second voice activity, determining that voice activity is present between a first timestamp associated with the first segment and a second timestamp associated with the second segment, and generating text data representing the voice activity that is present between the first timestamp and the second timestamp.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.