Patent · US Active

Streaming automatic speech recognition with non-streaming model distillation

US11804212B2 · kind B2 · utility

0Cited by
0References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 15, 2021
Grant dateOct 31, 2023
Priority date
Expiry dateOct 28, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/16
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for training a streaming automatic speech recognition student model includes receiving a plurality of unlabeled student training utterances. The method also includes, for each unlabeled student training utterance, generating a transcription corresponding to the respective unlabeled student training utterance using a plurality of non-streaming automated speech recognition (ASR) teacher models. The method further includes distilling a streaming ASR student model from the plurality of non-streaming ASR teacher models by training the streaming ASR student model using the plurality of unlabeled student training utterances paired with the corresponding transcriptions generated by the plurality of non-streaming ASR teacher models.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.