Patent · US Active

Speeding deduplication using a most wanted digest cache

US11093454B2 · kind B2 · utility

2Cited by
6References
17Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 31, 2017
Grant dateAug 17, 2021
Priority date
Expiry dateApr 11, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/137
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Embodiments are directed to techniques for performing deduplication. A method includes (a) obtaining a digest of a data block logically-positioned within a filesystem, the digest providing a hash value of data of the data block, (b) searching a Most Wanted Digest Cache (MWDC) within system memory for the digest, (c) locating an entry in the MWDC using the digest, wherein this locating indicates that the data block has the same data as another data block located elsewhere within the filesystem, the other data block having been previously persistently-stored, the entry having been added to the MWDC in response to the other data block having been deduplicated at least a plurality number of times, (d) locating a mapping structure referenced by the entry located from the MWDC, the mapping structure providing metadata about the other data block, and (e) deduplicating the data block and the other data block with reference to the located mapping structure.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.