Speeding deduplication using a most wanted digest cache
US11093454B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 31, 2017 |
| Grant date | Aug 17, 2021 |
| Priority date | — |
| Expiry date | Apr 11, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/137
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Embodiments are directed to techniques for performing deduplication. A method includes (a) obtaining a digest of a data block logically-positioned within a filesystem, the digest providing a hash value of data of the data block, (b) searching a Most Wanted Digest Cache (MWDC) within system memory for the digest, (c) locating an entry in the MWDC using the digest, wherein this locating indicates that the data block has the same data as another data block located elsewhere within the filesystem, the other data block having been previously persistently-stored, the entry having been added to the MWDC in response to the other data block having been deduplicated at least a plurality number of times, (d) locating a mapping structure referenced by the entry located from the MWDC, the mapping structure providing metadata about the other data block, and (e) deduplicating the data block and the other data block with reference to the located mapping structure.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.