Parallel document clustering process
US5864855A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Feb 26, 1996 |
| Grant date | Jan 26, 1999 |
| Priority date | — |
| Expiry date | Feb 26, 2016 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99936
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A computer information processing system utilizes parallel processors for organizing and clustering a large number of documents into a large number of clusters for information analysis and retrieval. After the documents are translated into electronic digital documents, each document is converted into a vector based on weighted list of the occurence of different words and terms that appear in the document. The document vectors are grouped together into cluster vectors on different parallel processors according to similarities. New document vectors are simultaneously compared with existing cluster vectors in the different parallel processors.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.