Patent · US Expired

Parallel document clustering process

US5864855A · kind A · utility

221Cited by
5References
1Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 26, 1996
Grant dateJan 26, 1999
Priority date
Expiry dateFeb 26, 2016

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99936
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A computer information processing system utilizes parallel processors for organizing and clustering a large number of documents into a large number of clusters for information analysis and retrieval. After the documents are translated into electronic digital documents, each document is converted into a vector based on weighted list of the occurence of different words and terms that appear in the document. The document vectors are grouped together into cluster vectors on different parallel processors according to similarities. New document vectors are simultaneously compared with existing cluster vectors in the different parallel processors.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.