Patent · US Active

System and method for data anonymization using hierarchical data clustering and perturbation

US9135320B2 · kind B2 · utility

7Cited by
0References
30Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 13, 2013
Grant dateSep 15, 2015
Priority date
Expiry dateNov 22, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/2228
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and method for data anonymization using hierarchical data clustering and perturbation is provided. The system includes a computer system and an anonymization program executed by the computer system. The system converts the data of a high-dimensional dataset to a normalized vector space and applies clustering and perturbation techniques to anonymize the data. The conversion results in each record of the dataset being converted into a normalized vector that can be compared to other vectors. The vectors are divided into disjointed, small-sized clusters using hierarchical clustering processes. Multi-level clustering can be performed using suitable algorithms at different clustering levels. The records within each cluster are then perturbed such that the statistical properties of the clusters remain unchanged.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.