Patent · US Active

Method and apparatus for content-aware and adaptive deduplication

US8412848B2 · kind B2 · utility

86Cited by
7References
79Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 29, 2009
Grant dateApr 2, 2013
Priority date
Expiry dateSep 10, 2030

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04L67/1097
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method, a system, an apparatus, and a computer readable medium for transmission of data across a network are disclosed. The method includes receiving a data stream, analyzing the received data stream to determine a starting location and an ending location of each zone within the received data stream, based on the starting and ending locations, generating a zone stamp identifying the zone, the zone stamp includes a sequence of contiguous characters representing at least a portion of data in the zone, wherein the order of characters in the zone stamp corresponds to the order of data in the zone, comparing the zone stamp with another zone stamp of another zone in any data stream received, determining whether the zone is substantially similar to another zone by detecting that the zone stamp is substantially similar to another zone stamp, delta-compressing zones within any data stream received that have been determined to have substantially similar zone stamps, thereby deduplicating zones having substantially similar zone stamps within any data stream received, and transmitting the deduplicated zones across the network from one storage location to another storage location.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.