Patent · US Active

Data sorting for generating RNN-T models

US12027153B2 · kind B2 · utility

0Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 21, 2022
Grant dateJul 2, 2024
Priority date
Expiry dateFeb 18, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2015/025
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A computer-implemented method for preparing training data for a speech recognition model is provided including obtaining a plurality of sentences from a corpus, dividing each phoneme in each sentence of the plurality of sentences into three hidden states, calculating, for each sentence of the plurality of sentences, a score based on a variation in duration of the three hidden states of each phoneme in the sentence, and sorting the plurality of sentences by using the calculated scores.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.