Patent · US Active

Speech recognition method and apparatus, and computer-readable storage medium

US12217739B2 · kind B2 · utility

0Cited by
3References
20Claims
0Family size

Assignee

Inventor

Key dates

Filing dateApr 30, 2020
Grant dateFeb 4, 2025
Priority date
Expiry dateDec 20, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L25/30
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A speech recognition method, including acquiring first linear frequency spectrums corresponding to audios to be trained with different sampling rates; determining the maximum sampling rate and other sampling rates; determining the maximum frequency domain sequence number of the first linear frequency spectrums as a first frequency domain sequence number and a second frequency domain sequence number; in the first linear frequency spectrums corresponding to the other sampling rate, configuring amplitude values corresponding to each frequency domain sequence number that is greater than the first frequency domain sequence number and less than or equal to the second frequency domain sequence number to be zero to obtain second linear frequency spectrums; determining first speech features and second voice features; and using the first speech features and the second speech features to train a machine learning model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.