System and method for partitioning backup data streams in a deduplication based storage system
US8983952B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 29, 2010 |
| Grant date | Mar 17, 2015 |
| Priority date | — |
| Expiry date | Feb 6, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F11/1453
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system and method for partitioning a data stream into a plurality of segments of varying sizes. A data stream manager partitions a received data stream into segments which are then conveyed to a deduplication engine for processing. The data stream received by the data stream manager includes metadata corresponding to the data stream. Based upon the metadata, which may include an indication as to a type of data included in the data stream, the data stream is partitioned into segments for further processing. A size of a segment used for partitioning given data is based at least in part on a type of data being partitioned. The variable segment sizes may be chosen to balance between maximizing the deduplication ratio and minimizing both the segment count and the size of the fingerprint index.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.