Patent · US Active

Multi-register-based speech detection method and related apparatus, and storage medium

US12051441B2 · kind B2 · utility

0Cited by

6References

20Claims

0Family size

Assignee

TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED · CN

Inventors

Jimeng Zheng · Lo Wu, CN
Lianwu Chen · Beijing, CN
Weiwei Li · Langfang, CN
Zhiyi Duan · Lo Wu, CN
Meng Yu · Lo Wu, CN
Dan Su · Nanhu, CN
Kaiyu Jiang · 红钢城街道, CN

Key dates

Filing date	Sep 13, 2022
Grant date	Jul 30, 2024
Priority date	—
Expiry date	Sep 13, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG10L2021/02166
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

This application discloses a multi-sound area-based speech detection method and related apparatus, and a storage medium, which is applied to the field of artificial intelligence. The method includes: obtaining sound area information corresponding to N sound areas including multiple users speaking simultaneously; generating a control signal corresponding to each target detection sound area according to user information corresponding to the target detection sound area; processing multi-user speech input signals by using the control signals, to obtain a speech output signal corresponding to each target detection sound area; generating a speech detection result of the target detection sound area according to the speech output signal corresponding to the target detection sound area; and selecting, among the multiple users, a main speaker based on the user information, the speech output signals and speech detection results of multiple users in the N sound areas.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.