Creating audio-centric, image-centric, and integrated audio-visual summaries
US6925455B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 25, 2001 |
| Grant date | Aug 2, 2005 |
| Priority date | — |
| Expiry date | Dec 27, 2023 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/26
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
Systems and methods create high quality audio-centric, image-centric, and integrated audio-visual summaries by seamlessly integrating image, audio, and text features extracted from input video. Integrated summarization may be employed when strict synchronization of audio and image content is not required. Video programming which requires synchronization of the audio content and the image content may be summarized using either an audio-centric or an image-centric approach. Both a machine learning-based approach and an alternative, heuristics-based approach are disclosed. Numerous probabilistic methods may be employed with the machine learning-based learning approach, such as naïve Bayes, decision tree, neural networks, and maximum entropy. To create an integrated audio-visual summary using the alternative, heuristics-based approach, a maximum-bipartite-matching approach is disclosed by way of example.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.