Patent · US Expired

Generating and matching hashes of multimedia content

US7549052B2 · kind B2 · utility

80Cited by
88References
28Claims
0Family size

Assignees

Inventors

Key dates

Filing dateFeb 11, 2002
Grant dateJun 16, 2009
Priority date
Expiry dateJul 6, 2024

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04H2201/90
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

Hashes are short summaries or signatures of data files which can be used to identify the file. Hashing multimedia content (audio, video, images) is difficult because the hash of original content and processed (e.g. compressed) content may differ significantly.The disclosed method generates robust hashes for multimedia content, for example, audio clips. The audio clip is divided (12) into successive (preferably overlapping) frames. For each frame, the frequency spectrum is divided (15) into bands. A robust property of each band (e.g. energy) is computed (16) and represented (17) by a respective hash bit. An audio clip is thus represented by a concatenation of binary hash words, one for each frame. To identify a possibly compressed audio signal, a block of hash words derived therefrom is matched by a computer (20) with a large database (21). Such matching strategies are also disclosed. In an advantageous embodiment, the extraction process also provides information (19) as to which of the hash bits are the least reliable. Flipping these bits considerably improves the speed and performance of the matching process.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.