Patent · US Active

Speech recognition method and apparatus, and computer-readable storage medium

US12217739B2 · kind B2 · utility

0Cited by

3References

20Claims

0Family size

Assignee

JINGDONG TECHNOLOGY HOLDING CO., LTD. · CN

Inventor

Li Fu · 东风镇, CN

Key dates

Filing date	Apr 30, 2020
Grant date	Feb 4, 2025
Priority date	—
Expiry date	Dec 20, 2040

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/30
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A speech recognition method, including acquiring first linear frequency spectrums corresponding to audios to be trained with different sampling rates; determining the maximum sampling rate and other sampling rates; determining the maximum frequency domain sequence number of the first linear frequency spectrums as a first frequency domain sequence number and a second frequency domain sequence number; in the first linear frequency spectrums corresponding to the other sampling rate, configuring amplitude values corresponding to each frequency domain sequence number that is greater than the first frequency domain sequence number and less than or equal to the second frequency domain sequence number to be zero to obtain second linear frequency spectrums; determining first speech features and second voice features; and using the first speech features and the second speech features to train a machine learning model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.