Speech separation method, electronic device, chip, and computer- readable storage medium
US12334092B2 · kind B2 · utility
Assignees
Inventors
Key dates
| Filing date | Aug 24, 2021 |
| Grant date | Jun 17, 2025 |
| Priority date | — |
| Expiry date | Feb 23, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/57
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A speech separation method is provided, and relates to the field of speech. The method includes: obtaining, in a speaking process of a user, audio information including a user speech and video information including a user face; coding the audio information to obtain a mixed acoustic feature; extracting a visual semantic feature of the user from the video information; inputting the mixed acoustic feature and the visual semantic feature into a preset visual speech separation network to obtain an acoustic feature of the user; and decoding the acoustic feature of the user to obtain a speech signal of the user. An electronic device, a chip, and a computer-readable storage medium are provided.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.