Data sorting for generating RNN-T models
US12027153B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 21, 2022 |
| Grant date | Jul 2, 2024 |
| Priority date | — |
| Expiry date | Feb 18, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/025
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A computer-implemented method for preparing training data for a speech recognition model is provided including obtaining a plurality of sentences from a corpus, dividing each phoneme in each sentence of the plurality of sentences into three hidden states, calculating, for each sentence of the plurality of sentences, a score based on a variation in duration of the three hidden states of each phoneme in the sentence, and sorting the plurality of sentences by using the calculated scores.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.