Patent · US Expired

System and method for synchronized text display and audio playback

US7346506B2 · kind B2 · utility

63Cited by
8References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 8, 2003
Grant dateMar 18, 2008
Priority date
Expiry dateFeb 14, 2026

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2015/225
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

An audio processing system and method for providing synchronized display of recognized text from an original audio file and playback of the original audio file. The system includes a speech recognition module, a silence insertion module, and a silence detection module. The speech recognition module generates text and audio pieces. The silence insertion module, aggregates the audio pieces into an aggregated audio file. The silence detection module converts the original audio file and the aggregated audio file into silence detected versions. Silent and non-silent blocks are identified using a threshold volume. The silence insertion module compares the silence detected original and aggregated audio files, determines the differences in position of non-silence elements and inserts silence within the audio pieces accordingly. The characteristics of the silence inserted audio pieces are used to synchronize the display of recognized text from an original audio file and playback of original audio file.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.