Patent · US Active

Optimization of fingerprint-based deduplication

US9047304B2 · kind B2 · utility

9Cited by
2References
24Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 28, 2011
Grant dateJun 2, 2015
Priority date
Expiry dateJun 18, 2032

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/951
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Described are embodiments of an invention for identifying chunk boundaries for optimization of fingerprint-based deduplication in a computing environment. Storage objects that are backed up in a computing environment are often compound storage objects which include many individual storage objects. The computing device of the computing environment breaks the storage objects into chunks of data by determining a hash value on a range of data. The computing device creates an artificial chunk boundary when the end of data of the storage object is reached. When an artificial chunk boundary is created for the end of data of a storage object, the computing device stores a pseudo fingerprint for the artificial chunk boundary. If a hash value matches a fingerprint or a pseudo fingerprint, then the computing device determines that the range of data corresponds to a chunk and the computing system defines the chunk boundaries.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.