Patent · US Expired

Unsupervised identification of nonlinear data cluster in multidimensional data

US6226408A · kind A · utility

91Cited by
11References
13Claims
0Family size

Assignee

Inventor

Key dates

Filing dateJan 29, 1999
Grant dateMay 1, 2001
Priority date
Expiry dateJan 29, 2019

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F18/23213
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system, method, and software product provide for unsupervised identification of complex, nonlinear subspaces in high dimensional data. The system includes a vector quantization module, a weighted topology representing graph module, and an encoding module. The vector quantization module takes vector data inputs and extracts a group of inputs about a number of cluster centers, using a globally optimized clustering process. The weighted topology representing graph module creates a weighted graph of the vector space, using the cluster centers as nodes, weighting edges between nodes as a function of the density of the vectors between the linked nodes. The encoding module uses the weighted graph to recode the input vectors based on their proximity to the cluster centers and the connectedness of the graph. The recoded vectors are reinput into the vector quantization module, and the process repeated until termination, for example at a limited number of cluster centers. Upon termination, the clusters thus identified may be highly nonlinear in the original data space.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.