Patent · US Active

Speech separation method, electronic device, chip, and computer- readable storage medium

US12334092B2 · kind B2 · utility

0Cited by

6References

20Claims

0Family size

Assignees

Inventors

Henghui Lu · Beijing, CN
Lei Qin · Natick, US
Peng Zhang · Espoo, FI
Jiaming Xu · Beijing, CN
Bo Xu · Beijing, CN

Key dates

Filing date	Aug 24, 2021
Grant date	Jun 17, 2025
Priority date	—
Expiry date	Feb 23, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/57
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A speech separation method is provided, and relates to the field of speech. The method includes: obtaining, in a speaking process of a user, audio information including a user speech and video information including a user face; coding the audio information to obtain a mixed acoustic feature; extracting a visual semantic feature of the user from the video information; inputting the mixed acoustic feature and the visual semantic feature into a preset visual speech separation network to obtain an acoustic feature of the user; and decoding the acoustic feature of the user to obtain a speech signal of the user. An electronic device, a chip, and a computer-readable storage medium are provided.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.