Patent · US Active

Computer systems exhibiting improved computer speed and transcription accuracy of automatic speech transcription (AST) based on a multiple speech-to-text engines and methods of use thereof

US12219154B2 · kind B2 · utility

1Cited by

3References

17Claims

0Family size

Assignee

VOXSMART LIMITED · GB

Inventors

Tejas Shastry · Chicago, US
Matthew Goldey · Chicago, US
Svyat Vergun · Morton Grove, US

Key dates

Filing date	Nov 30, 2022
Grant date	Feb 4, 2025
Priority date	—
Expiry date	Jul 31, 2043

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/51
WIPO fieldAudio-visual technology
WIPO sectorElectrical engineering

Abstract

In some embodiments, an exemplary inventive system for improving computer speed and accuracy of automatic speech transcription includes at least components of: a computer processor configured to perform: generating a recognition model specification for a plurality of distinct speech-to-text transcription engines; where each distinct speech-to-text transcription engine corresponds to a respective distinct speech recognition model; receiving at least one audio recording representing a speech of a person; segmenting the audio recording into a plurality of audio segments; determining a respective distinct speech-to-text transcription engine to transcribe a respective audio segment; receiving, from the respective transcription engine, a hypothesis for the respective audio segment; accepting the hypothesis to remove a need to submit the respective audio segment to another distinct speech-to-text transcription engine, resulting in the improved computer speed and the accuracy of automatic speech transcription and generating a transcript of the audio recording from respective accepted hypotheses for the plurality of audio segments.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.