Patent · US Active

Apparatus and methods for anonymizing a data set

US8943079B2 · kind B2 · utility

5Cited by
3References
17Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 1, 2012
Grant dateJan 27, 2015
Priority date
Expiry dateMar 10, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F21/6254
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods and systems are disclosed for anonymizing a dataset that correlates a set of entities with respective attributes. The method comprises determine clusters of similar entities. Determining the clusters comprises (1) partitioning the entities into a first group with similar attributes to one another and a complement group of entities with similar attributes to one another and (2) recursively repeating the partitioning on the groups until every group meets one or more criteria. The partitioning a group comprises choosing a reference entity from the group, determining a symmetric set of attributes based on the reference entity attributes and on an average of the group's attributes, and assigning each entity to the first or second group depending on whether its attributes are more similar to those of the reference user or to those of the symmetric set.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.