Patent · US Active

Method and system for identifying clusters within a collection of data entities

US9256681B2 · kind B2 · utility

1Cited by
3References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 28, 2012
Grant dateFeb 9, 2016
Priority date
Expiry dateNov 28, 2032

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/9535
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Embodiments of a method and system for identifying clusters in collections of data entities are generally described herein. In some embodiments, the method includes defining a metric space over the data entities. A distance function of the metric space may satisfy the triangle inequality. The method may include determining, based on the distance function of the metric space, a value for a number of clusters that minimizes a number of data bits used to define a model of the collection of the data entities. The model may thereby describe the collection of data entities using a minimum description length (MDL). The method may include assigning data entities of the collection of data entities to the clusters. The number of clusters to which the data entities are assigned may correspond to the determined value.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.