Speech recognition method and apparatus, and computer-readable storage medium
US12217739B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Apr 30, 2020 |
| Grant date | Feb 4, 2025 |
| Priority date | — |
| Expiry date | Dec 20, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/30
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A speech recognition method, including acquiring first linear frequency spectrums corresponding to audios to be trained with different sampling rates; determining the maximum sampling rate and other sampling rates; determining the maximum frequency domain sequence number of the first linear frequency spectrums as a first frequency domain sequence number and a second frequency domain sequence number; in the first linear frequency spectrums corresponding to the other sampling rate, configuring amplitude values corresponding to each frequency domain sequence number that is greater than the first frequency domain sequence number and less than or equal to the second frequency domain sequence number to be zero to obtain second linear frequency spectrums; determining first speech features and second voice features; and using the first speech features and the second speech features to train a machine learning model.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.