Patent · US Active

Data deduplication

US8799238B2 · kind B2 · utility

14Cited by
0References
15Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 8, 2010
Grant dateAug 5, 2014
Priority date
Expiry dateNov 3, 2030

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F11/1469
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for data deduplication includes receiving a set of hashes derived from a data chunk of a set of input data chunks 310. The method includes sampling the set of hashes 320, using an index indentifying data chunk containers that hold data chunks having a hash in the set of sampled hashes 330, and loading indexes for at least one of the identified data chunk containers 340. The method includes determining which of the hashes correspond to data chunks stored in data chunk containers corresponding to the loaded indexes 350 and deciding which of the set of input data chunks should be stored based at least in part on the determination.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.