Patent · US Active

System and method for efficiently generating cluster groupings in a multi-dimensional concept space

US8402026B2 · kind B2 · utility

25Cited by
182References
25Claims
0Family size

Assignee

Inventor

Key dates

Filing dateAug 3, 2004
Grant dateMar 19, 2013
Priority date
Expiry dateNov 15, 2028

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99943
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and method for efficiently generating cluster groupings in a multi-dimensional concept space is described. A plurality of terms is extracted from each document in a collection of stored unstructured documents. A concept space is built over the document collection. Terms substantially correlated between a plurality of documents within the document collection are identified. Each correlated term is expressed as a vector mapped along an angle θ originating from a common axis in the concept space. A difference between the angle θ for each document and an angle σ for each cluster within the concept space is determined. Each such cluster is populated with those documents having such difference between the angle θ for each such document and the angle σ for each such cluster falling within a predetermined variance. A new cluster is created within the concept space those documents having such difference between the angle θ for each such document and the angle σ for each such cluster falling outside the predetermined variance.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.