Unsupervised identification of nonlinear data cluster in multidimensional data
US6226408A · kind A · utility
Assignee
Inventor
Key dates
| Filing date | Jan 29, 1999 |
| Grant date | May 1, 2001 |
| Priority date | — |
| Expiry date | Jan 29, 2019 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F18/23213
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system, method, and software product provide for unsupervised identification of complex, nonlinear subspaces in high dimensional data. The system includes a vector quantization module, a weighted topology representing graph module, and an encoding module. The vector quantization module takes vector data inputs and extracts a group of inputs about a number of cluster centers, using a globally optimized clustering process. The weighted topology representing graph module creates a weighted graph of the vector space, using the cluster centers as nodes, weighting edges between nodes as a function of the density of the vectors between the linked nodes. The encoding module uses the weighted graph to recode the input vectors based on their proximity to the cluster centers and the connectedness of the graph. The recoded vectors are reinput into the vector quantization module, and the process repeated until termination, for example at a limited number of cluster centers. Upon termination, the clusters thus identified may be highly nonlinear in the original data space.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.