Patent · US Active

Generation of timed text using speech-to-text technology and applications thereof

US8645134B1 · kind B1 · utility

13Cited by
3References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 18, 2010
Grant dateFeb 4, 2014
Priority date
Expiry dateFeb 25, 2032

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/30
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

Embodiments relate to generation of timed text in web video. In an embodiment, a computer-implemented method generates timed text for online video. In the method, a request to play a timed text track of a video incorporated into a web video service is received from a client computing device. Prior to receipt of the request, audio of the video is processed to determine intermediate timed text data. The intermediate timed text data lacks a complete text transcription of the audio, but includes data to enable the complete text transcription to be generated when playing the video. In response to receipt of the request, a text transcription of the audio is determined using the intermediate data with an automated speech-to-text algorithm. Finally, the text transcription of the audio is sent to the client computing device for display along with the video.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.