Patent · US Active

Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment

US9495964B2 · kind B2 · utility

18Cited by
4References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 16, 2016
Grant dateNov 15, 2016
Priority date
Expiry dateMar 16, 2036

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04M2203/305
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for captioning a media presentation. The method includes receiving automatic speech recognition (ASR) output from a media presentation and a transcription of the media presentation. The method includes selecting via a processor a pair of anchor words in the media presentation based on the ASR output and transcription and generating captions by aligning the transcription with the ASR output between the selected pair of anchor words. The transcription can be human-generated. Selecting pairs of anchor words can be based on a similarity threshold between the ASR output and the transcription. In one variation, commonly used words on a stop list are ineligible as anchor words. The method includes outputting the media presentation with the generated captions. The presentation can be a recording of a live event.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.