Patent · US Active

Detection and deduplication of backup sets exhibiting poor locality

US9122639B2 · kind B2 · utility

6Cited by
44References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 25, 2011
Grant dateSep 1, 2015
Priority date
Expiry dateJul 3, 2032

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F2201/81
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Described are computer-based methods and apparatuses, including computer program products, for detection and deduplication of backup sets exhibiting poor locality. A first set of summaries of a first data set are determined, each summary of the first set of summaries being indicative of a data pattern in the first data set. A second set of summaries of a second data set are determined, each summary of the second set of summaries being indicative of a data pattern in the second data set. A set of comparison metrics are calculated, each comparison metric being based on a first subset of summaries from the first set of summaries and a second subset of summaries from the second set of summaries. A locality metric is calculated based on the set of comparison metrics indicative of whether the first data set and second data set exhibit poor locality.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.