Patent · US Active

Deliberation by text-only and semi-supervised training

US12354595B2 · kind B2 · utility

0Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 18, 2023
Grant dateJul 8, 2025
Priority date
Expiry dateJan 17, 2044

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/16
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method of text-only and semi-supervised training for deliberation includes receiving training data including unspoken textual utterances that are each not paired with any corresponding spoken utterance of non-synthetic speech, and training a deliberation model that includes a text encoder and a deliberation decoder on the unspoken textual utterances. The method also includes receiving, at the trained deliberation model, first-pass hypotheses and non-causal acoustic embeddings. The first-pass hypotheses is generated by a recurrent neural network-transducer (RNN-T) decoder for the non-causal acoustic embeddings encoded by a non-causal encoder. The method also includes encoding, using the text encoder, the first-pass hypotheses generated by the RNN-T decoder, and generating, using the deliberation decoder attending to both the first-pass hypotheses and the non-causal acoustic embeddings, second-pass hypotheses.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.