System and method for synchronized text display and audio playback
US7346506B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 8, 2003 |
| Grant date | Mar 18, 2008 |
| Priority date | — |
| Expiry date | Feb 14, 2026 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/225
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An audio processing system and method for providing synchronized display of recognized text from an original audio file and playback of the original audio file. The system includes a speech recognition module, a silence insertion module, and a silence detection module. The speech recognition module generates text and audio pieces. The silence insertion module, aggregates the audio pieces into an aggregated audio file. The silence detection module converts the original audio file and the aggregated audio file into silence detected versions. Silent and non-silent blocks are identified using a threshold volume. The silence insertion module compares the silence detected original and aggregated audio files, determines the differences in position of non-silence elements and inserts silence within the audio pieces accordingly. The characteristics of the silence inserted audio pieces are used to synchronize the display of recognized text from an original audio file and playback of original audio file.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.