Generation of timed text using speech-to-text technology and applications thereof
US8645134B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 18, 2010 |
| Grant date | Feb 4, 2014 |
| Priority date | — |
| Expiry date | Feb 25, 2032 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/30
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
Embodiments relate to generation of timed text in web video. In an embodiment, a computer-implemented method generates timed text for online video. In the method, a request to play a timed text track of a video incorporated into a web video service is received from a client computing device. Prior to receipt of the request, audio of the video is processed to determine intermediate timed text data. The intermediate timed text data lacks a complete text transcription of the audio, but includes data to enable the complete text transcription to be generated when playing the video. In response to receipt of the request, a text transcription of the audio is determined using the intermediate data with an automated speech-to-text algorithm. Finally, the text transcription of the audio is sent to the client computing device for display along with the video.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.