Model training system for custom speech-to-text models
US11551695B1 · kind B1 · utility
Assignee
Inventors
- Vivek Govindan
- Varun Sembium Varadarajan
- Christian Egon Berkhoff Dossow
- Himalay Mohanlal Joriwal
- Sai Madhuri Bhavirisetty
- Abhinav Kumar
- Orestis Lykouropoulos
- Akshay Nalwaya
- Rahul Gupta
- Sravan Babu Bodapati
- Liangwei Guo
- Julian E. S. Salazar
- Yibin Wang
- K P N V D S Siva Rama
- Calvin Xuan Li
- Mohit Gupta
- Asem Rustum
- Katrin Kirchhoff
- Pu Paul Zhao
Key dates
| Filing date | May 13, 2020 |
| Grant date | Jan 10, 2023 |
| Priority date | — |
| Expiry date | Oct 9, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/0638
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A transcription service may receive a request from a developer to build a custom speech-to-text model for a specific domain of speech. The custom speech-to-text model for the specific domain may replace a general speech-to-text model or add to a set of one or more speech-to-text models available for transcribing speech. The transcription service may receive a training data and instructions representing tasks. The transcription service may determine respective schedules for executing the instructions based at least in part on dependencies between the tasks. The transcription service may execute the instructions according to the respective schedules to train a speech-to-text model for a specific domain using the training data set. The transcription service may deploy the trained speech-to-text model as part of a network-accessible service for an end user to convert audio in the specific domain into texts.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.