Patent · US Active

System and method for end-to-end speech recognition with triggered attention

US11100920B2 · kind B2 · utility

1Cited by

3References

15Claims

0Family size

Assignee

Mitsubishi Electric Research Laboratories, Inc. · US

Inventors

Niko Moritz · Brookline, US
Takaaki Hori · Lexington, US
Jonathan Le Roux · Somerville, US

Key dates

Filing date	Mar 25, 2019
Grant date	Aug 24, 2021
Priority date	—
Expiry date	Aug 16, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/30
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A speech recognition system includes an encoder to convert an input acoustic signal into a sequence of encoder states, an alignment decoder to identify locations of encoder states in the sequence of encoder states that encode transcription outputs, a partition module to partition the sequence of encoder states into a set of partitions based on the locations of the identified encoder states, and an attention-based decoder to determine the transcription outputs for each partition of encoder states submitted to the attention-based decoder as an input. Upon receiving the acoustic signal, the system uses the encoder to produce the sequence of encoder states, partitions the sequence of encoder states into the set of partitions based on the locations of the encoder states identified by the alignment decoder, and submits the set of partitions sequentially into the attention-based decoder to produce a transcription output for each of the submitted partitions.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.