Patent · US Active

Method and system for distributed garbage collection of deduplicated datasets

US10235285B1 · kind B1 · utility

70Cited by
13References
21Claims
0Family size

Assignee

Inventor

Key dates

Filing dateMar 31, 2016
Grant dateMar 19, 2019
Priority date
Expiry dateApr 1, 2037

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F2212/7205
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Embodiments relating to garbage collection for a deduplicated and compressed storage device are described. One embodiment provides for a computer implemented method including creating a multiple sets of Bloom filters distributed across a set of multiple computing device nodes. One set of Bloom filters stores differing ranges of fingerprints for data stored on deduplicated storage containers, while a second set of Bloom filters includes a set of fingerprints correlated with files in a file system directory structure. A set of live fingerprints is determined for live fingerprints and storage segments associated with those fingerprints are copied to new storage containers.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.