Patent · US Active

Streaming real-time automatic speech recognition service

US10777186B1 · kind B1 · utility

5Cited by
10References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 13, 2018
Grant dateSep 15, 2020
Priority date
Expiry dateMar 15, 2039

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04L63/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques for streaming real-time automated speech recognition (ASR) are described. A user can stream audio data to a frontend service of the ASR service. The frontend service can establish a bi-directional connection to an audio decoder host to perform ASR on the data stream. The audio decoder host may include a streaming ASR engine which can analyze chunks of the audio data stream using an acoustic model to divide the audio data into words, and a language model to identify sentences made of the words spoken in the audio file. The acoustic model can be trained using short audio sentence data (e.g., on the order of 30 seconds to a few minutes), enabling the transcription service to accurately transcribe short chunks of audio data. The results are then punctuated and normalized. The resulting transcript is then streamed back to the user over the bi-directional connection.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.