Patent · US Active

Systems and methods for automatic speech recognition based on graphics processing units

US11562734B2 · kind B2 · utility

1Cited by

1References

20Claims

0Family size

Assignee

KWAI INC. · US

Inventors

Yongxiong Ren · San Jose, US
Yang Liu · Nanhu, CN
Heng Liu · Beijing, CN
Lingzhi Liu · San Jose, US
Jie Li · Lo Wu, CN
Kaituo Xu · Beijing, CN
Xiaorui Wang · Beijing, CN

Key dates

Filing date	Jan 4, 2021
Grant date	Jan 24, 2023
Priority date	—
Expiry date	Feb 4, 2041

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/30
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

The present disclosure relates to an automatic speech recognition system and a method thereof. The system includes a conformer encoder and a pair of ping-pong buffers. The encoder includes a plurality of encoder layers sequentially executed by one or more graphic processing units. At least one encoder layer includes a first feed forward module, a multi-head self-attention module, a convolution module, and a second feed forward module. The convolution module and the multi-head self-attention module are sandwiched between the first feedforward module and the second feed forward module. The four modules respectively include a plurality of encoder sublayers fused into one or more encoder kernels. The one or more encoder kernels respectively read from one of the pair of ping-pong buffers and write into the other of the pair of ping-pong buffers.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.