Patent · US Active

Generation of timed text using speech-to-text technology and applications thereof

US8645134B1 · kind B1 · utility

13Cited by

3References

20Claims

0Family size

Assignee

Google LLC · US

Inventors

Kenneth L. Harrenstien · Palo Alto, US
Toliver Jue · Tokyo, JP
Christopher Alberti · New York, US
Naomi D. Black-Bilodeau · Corona, US

Key dates

Filing date	Nov 18, 2010
Grant date	Feb 4, 2014
Priority date	—
Expiry date	Feb 25, 2032

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/30
WIPO fieldAudio-visual technology
WIPO sectorElectrical engineering

Abstract

Embodiments relate to generation of timed text in web video. In an embodiment, a computer-implemented method generates timed text for online video. In the method, a request to play a timed text track of a video incorporated into a web video service is received from a client computing device. Prior to receipt of the request, audio of the video is processed to determine intermediate timed text data. The intermediate timed text data lacks a complete text transcription of the audio, but includes data to enable the complete text transcription to be generated when playing the video. In response to receipt of the request, a text transcription of the audio is determined using the intermediate data with an automated speech-to-text algorithm. Finally, the text transcription of the audio is sent to the client computing device for display along with the video.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.