Patent · US Expired

System and method for efficiently generating cluster groupings in a multi-dimensional concept space

US6778995B1 · kind B1 · utility

110Cited by
12References
32Claims
0Family size

Assignee

Inventor

Key dates

Filing dateAug 31, 2001
Grant dateAug 17, 2004
Priority date
Expiry dateFeb 18, 2022

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99943
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and method for efficiently generating cluster groupings in a multi-dimensional concept space is described. A plurality of terms are extracted from each document in a collection of stored unstructured documents. A concept space is built over the document collection. Terms substantially correlated between a plurality of documents within the document collection are identified. Each correlated term is expressed as a vector mapped along an angle &thgr; originating from a common axis in the concept space. A difference between the angle &thgr; for each document and an angle &sgr; for each cluster within the concept space is determined. Each such cluster is populated with those documents having such difference between the angle &thgr; for each such document and the angle &sgr; for each such cluster falling within a predetermined variance. A new cluster is created within the concept space those documents having such difference between the angle &thgr; for each such document and the angle &sgr; for each such cluster falling outside the predetermined variance.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.