Scalable deduplication system with small blocks
US9747055B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 8, 2015 |
| Grant date | Aug 29, 2017 |
| Priority date | — |
| Expiry date | Aug 13, 2035 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH03M7/3093
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Exemplary method, system, and computer program product embodiments for scalable data deduplication working with small data chunk in a computing environment are provided. In one embodiment, by way of example only, for each small data chunk, a signature is generated based on a combination of a representation of characters used in selecting data to be deduplicated. A c-spectrum of the small data chunk being a sequence of representations of different characters ordered by a frequency of occurrence in the small data chunk, and an f-spectrum of the small data chunk being a corresponding sequence of frequencies of the different characters in the small data chunk.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.