Patent · US Active

Speech activity detection using dual sensory based learning

US11451742B2 · kind B2 · utility

0Cited by

2References

19Claims

0Family size

Assignee

BlackBerry Limited · CA

Inventor

Shiladitya Sircar · Ottawa, CA

Key dates

Filing date	Dec 4, 2020
Grant date	Sep 20, 2022
Priority date	—
Expiry date	Dec 4, 2040

Classification

Technology area (CPC H)Electricity
CPC primaryH04M3/569
WIPO fieldAudio-visual technology
WIPO sectorElectrical engineering

Abstract

A dual sensory input speech detection method includes receiving, at a first time, a first video image input of a conference participant of the video conference and a first audio input of the conference participant; communicating the first video image input to the video conference; identifying the first video image input as a first facial image of the conference participant; determining, based on the first facial image, the first video image input indicates the conference participant is in a speaking state; identifying the first audio input as a first speech sound; determining, while in the speaking state, the first speech sound originates from the conference participant; and communicating the first audio input to an audio output for the video conference.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.