System and method for data anonymization using hierarchical data clustering and perturbation
US9135320B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 13, 2013 |
| Grant date | Sep 15, 2015 |
| Priority date | — |
| Expiry date | Nov 22, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/2228
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system and method for data anonymization using hierarchical data clustering and perturbation is provided. The system includes a computer system and an anonymization program executed by the computer system. The system converts the data of a high-dimensional dataset to a normalized vector space and applies clustering and perturbation techniques to anonymize the data. The conversion results in each record of the dataset being converted into a normalized vector that can be compared to other vectors. The vectors are divided into disjointed, small-sized clusters using hierarchical clustering processes. Multi-level clustering can be performed using suitable algorithms at different clustering levels. The records within each cluster are then perturbed such that the statistical properties of the clusters remain unchanged.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.