Patent · US Active

Speech separation model training method and apparatus, storage medium and computer device

US11908455B2 · kind B2 · utility

0Cited by

3References

20Claims

0Family size

Assignee

TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED · CN

Inventors

Jun Wang · Lo Wu, CN
Wingyip LAM · Lo Wu, CN
Dan Su · Nanhu, CN
Dong YU · Zhejiang, CN

Key dates

Filing date	Feb 15, 2022
Grant date	Feb 20, 2024
Priority date	—
Expiry date	Sep 3, 2042

Classification

Technology area (CPC Y)Emerging Cross-Sectional Technologies
CPC primaryY02T10/40
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A speech separation model training method and apparatus, a computer-readable storage medium, and a computer device are provided, the method including: obtaining first audio and second audio, the first audio including target audio and having corresponding labeled audio, and the second audio including noise audio. obtaining an encoding model, an extraction model, and an initial estimation model; performing unsupervised training on the encoding model, the extraction model, and the estimation model according to the second audio, and adjusting model parameters of the extraction model and the estimation model; performing supervised training on the encoding model and the extraction model according to the first audio and the labeled audio corresponding to the first audio, and adjusting a model parameter of the encoding model; continuously performing the unsupervised training and the supervised training, so that the unsupervised training and the supervised training overlap, and the training is not finished until a training stop condition is met.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.