Patent · US Active

Selection of hash key sizes for data deduplication

US11232075B2 · kind B2 · utility

0Cited by
2References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 25, 2018
Grant dateJan 25, 2022
Priority date
Expiry dateMay 16, 2040

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH03M7/6035
  • WIPO fieldBasic communication processes
  • WIPO sectorElectrical engineering

Abstract

Techniques for data processing may include: receiving a data chunk; determining a metric value denoting a degree of compressibility of the data chunk; selecting, in accordance with the metric value denoting the compressibility of the data chunk, a first size of a plurality of sizes, wherein each of the plurality of sizes denotes a different size of an amount of storage used for storing a value of said each size; and performing the data deduplication processing for the data chunk, wherein the data deduplication processing includes using a first hash value for the data chunk to determine whether the data chunk is a duplicate of another data chunk of a hash table, wherein the first hash value is stored in a storage location of the first size.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.