Patent · US Active

Scalable deduplication system with small blocks

US9747055B2 · kind B2 · utility

0Cited by
33References
15Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 8, 2015
Grant dateAug 29, 2017
Priority date
Expiry dateAug 13, 2035

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH03M7/3093
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Exemplary method, system, and computer program product embodiments for scalable data deduplication working with small data chunk in a computing environment are provided. In one embodiment, by way of example only, for each small data chunk, a signature is generated based on a combination of a representation of characters used in selecting data to be deduplicated. A c-spectrum of the small data chunk being a sequence of representations of different characters ordered by a frequency of occurrence in the small data chunk, and an f-spectrum of the small data chunk being a corresponding sequence of frequencies of the different characters in the small data chunk.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.