Patent · US Expired

Indexing multimedia communications

US6377995B2 · kind B2 · utility

125Cited by
9References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 19, 1998
Grant dateApr 23, 2002
Priority date
Expiry dateFeb 19, 2018

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04N7/152
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

A network based platform uses face recognition, speech recognition, background change detection and key scene events to index multimedia communications. Before the multimedia communication begins, active participants register their speech and face models with a server. The process consists of creating a speech sample, capturing a sample image of the participant and storing the data in a database. The server provides an indexing function for the multimedia communication. During the multimedia communication, metadata including time stamping is retained along with the multimedia content. The time stamping information is used for synchronizing the multimedia elements. The multimedia communication is then processed through the server to identify the multimedia communication participants based on speaker and face recognition models. This allows the server to create an index table that becomes an index of the multimedia communication. In addition, through scene change detection and background recognition, certain backgrounds and key scene information can be used for indexing. Therefore, through this indexing apparatus and method, a specific participant can be recognized as speaking and th…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.