Patent · US Active

Multi-task automatic speech recognition system

US12079587B1 · kind B1 · utility

0Cited by

10References

18Claims

0Family size

Assignee

OpenAI OpCo, LLC · US

Inventors

Alec Radford · San Francisco, US
Jong Wook Kim · Seoul, KR
Tao Xu · Hangzhou City, CN
Greg D. Brockman · San Francisco, US
Christine McLeavey-Payne · San Francisco, US
Ilya Sutskever · San Francisco, US

Key dates

Filing date	Apr 18, 2023
Grant date	Sep 3, 2024
Priority date	—
Expiry date	Apr 18, 2043

Classification

Technology area (CPC G)Physics
CPC primaryG06F40/58
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Disclosed herein are methods, systems, and computer-readable media for generating an output transcript from an input audio segment using a multi-task transformer model. In some embodiments, the transformer model can be trained to transcribe or translate audio data in multiple languages using labeled audio data. The labeled audio data can include first audio segments associated with first same-language transcripts of the first audio segments and second audio segments associated with second different-language transcripts of the second audio segments. In some embodiments, a vocabulary of the model can include special purpose and time stamp tokens. The special purpose tokens can specify tasks for the model to perform.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.