Patent · US Active

Method for reducing redundancy between two or more datasets

US8527482B2 · kind B2 · utility

3Cited by
5References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 8, 2009
Grant dateSep 3, 2013
Priority date
Expiry dateJul 27, 2030

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/2379
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for reducing redundancy between two or more datasets of potentially very large size. The method improves upon current technology by oversubscribing the data structure that represents a digest of data blocks and using positional information about matching data so that very large datasets can be analyzed and the redundancies removed by, having found a match on digest, expands the match in both directions in order to detect and eliminate large runs of data by replace duplicate runs with references to common data. The method is particularly useful for capturing the states of images of a hard disk. The method permits several files to have their redundancy removed and the files to later be reconstituted. The method is appropriate for use on a WORM device. The method can also make use of L2 cache to improve performance.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.