Amalgamating multimedia transcripts for closed captioning from a plurality of text to speech conversions
US9332319B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 27, 2010 |
| Grant date | May 3, 2016 |
| Priority date | — |
| Expiry date | Feb 10, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/26
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
Methods and systems for converting speech to text are disclosed. One method includes analyzing multimedia content to determine the presence of closed captioning data. The method includes, upon detecting closed captioning data, indexing the closed captioning data as associated with the multimedia content. The method also includes, upon failure to detect closed captioning data in the multimedia content, extracting audio data from multimedia content, the audio data including speech data, performing a plurality of speech to text conversions on the speech data to create a plurality of transcripts of the speech data, selecting text from one or more of the plurality of transcripts to form an amalgamated transcript, and indexing the amalgamated transcript as associated with the multimedia content.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.