Patent · US Active

Speech recognition

US12183362B1 · kind B1 · utility

0Cited by

3References

20Claims

0Family size

Assignee

MASHANG CONSUMER FINANCE CO., LTD. · CN

Inventors

Qinglin Meng · Sunnyvale, US
Bin Yang · Pullman, US
Ning Jiang · 红钢城街道, CN
Haiying WU · Chongqing, CN
Quan Lu · Shanghai, CN
Min Liu · Kunshan, CN

Key dates

Filing date	Apr 11, 2024
Grant date	Dec 31, 2024
Priority date	—
Expiry date	Apr 11, 2044

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/08
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A speech recognition method. The method includes: performing speech activity detection on speech data to obtain multiple speech segments; determining, for each of the speech segments, a number of speakers involved in the each of the speech segments; for each of at least one of the speech segments with the determined number greater than 1: performing speech separation on the each of at least one of the speech segments to obtain multiple audio segments; performing speech recognition on each of the audio segments to obtain respective first speech recognition results for the audio segments; performing feature extraction on each of the audio segments to obtain respective voiceprint feature vectors; and performing clustering on the audio segments with respect to the speakers to obtain a clustering result; and obtaining a second speech recognition result for the speech data based on the clustering result and the respective first speech recognition results.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.