Patent · US Active

System and method for partitioning backup data streams in a deduplication based storage system

US8983952B1 · kind B1 · utility

108Cited by
50References
17Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 29, 2010
Grant dateMar 17, 2015
Priority date
Expiry dateFeb 6, 2031

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F11/1453
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and method for partitioning a data stream into a plurality of segments of varying sizes. A data stream manager partitions a received data stream into segments which are then conveyed to a deduplication engine for processing. The data stream received by the data stream manager includes metadata corresponding to the data stream. Based upon the metadata, which may include an indication as to a type of data included in the data stream, the data stream is partitioned into segments for further processing. A size of a segment used for partitioning given data is based at least in part on a type of data being partitioned. The variable segment sizes may be chosen to balance between maximizing the deduplication ratio and minimizing both the segment count and the size of the fingerprint index.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.