Patent · US Active

System and method for end-to-end speech recognition with triggered attention

US11100920B2 · kind B2 · utility

1Cited by
3References
15Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 25, 2019
Grant dateAug 24, 2021
Priority date
Expiry dateAug 16, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L25/30
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A speech recognition system includes an encoder to convert an input acoustic signal into a sequence of encoder states, an alignment decoder to identify locations of encoder states in the sequence of encoder states that encode transcription outputs, a partition module to partition the sequence of encoder states into a set of partitions based on the locations of the identified encoder states, and an attention-based decoder to determine the transcription outputs for each partition of encoder states submitted to the attention-based decoder as an input. Upon receiving the acoustic signal, the system uses the encoder to produce the sequence of encoder states, partitions the sequence of encoder states into the set of partitions based on the locations of the encoder states identified by the alignment decoder, and submits the set of partitions sequentially into the attention-based decoder to produce a transcription output for each of the submitted partitions.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.