Patent · US Active

Computer systems exhibiting improved computer speed and transcription accuracy of automatic speech transcription (AST) based on a multiple speech-to-text engines and methods of use thereof

US12219154B2 · kind B2 · utility

1Cited by
3References
17Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 30, 2022
Grant dateFeb 4, 2025
Priority date
Expiry dateJul 31, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L25/51
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

In some embodiments, an exemplary inventive system for improving computer speed and accuracy of automatic speech transcription includes at least components of: a computer processor configured to perform: generating a recognition model specification for a plurality of distinct speech-to-text transcription engines; where each distinct speech-to-text transcription engine corresponds to a respective distinct speech recognition model; receiving at least one audio recording representing a speech of a person; segmenting the audio recording into a plurality of audio segments; determining a respective distinct speech-to-text transcription engine to transcribe a respective audio segment; receiving, from the respective transcription engine, a hypothesis for the respective audio segment; accepting the hypothesis to remove a need to submit the respective audio segment to another distinct speech-to-text transcription engine, resulting in the improved computer speed and the accuracy of automatic speech transcription and generating a transcript of the audio recording from respective accepted hypotheses for the plurality of audio segments.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.