Patent · US Active

Deduplication featuring variable-size duplicate data detection and fixed-size data segment sharing

US10656858B1 · kind B1 · utility

1Cited by
7References
9Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 7, 2016
Grant dateMay 19, 2020
Priority date
Expiry dateMar 23, 2037

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F3/0641
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A hybrid deduplication system operates to detect variable-sized deduplication matches, while performing the storage deduplication on fixed-size segments of data. The hybrid deduplication system calculates unique identifiers for variable-sized sections of data within a data stream being written to a deduplicated data store. The hybrid deduplication system then compares those newly-calculated identifiers to identifiers of variable-sized sections of data that have already been stored within the deduplicated data store. If a match is found, the hybrid deduplication system identifies the location of each of the fixed-size data segment(s), already stored in the deduplicated data store, that include the identified variable-sized section of data. Instead of writing the sections that match already-existing sections to the deduplicated data store, the hybrid deduplication system simply causes the creation of a reference to the identified storage locations, indicating that the data stream being written includes the data in these pre-existing storage locations.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.