Method and system for text mining using multidimensional subspaces
US6611825B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 9, 1999 |
| Grant date | Aug 26, 2003 |
| Priority date | — |
| Expiry date | Jun 9, 2019 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99931
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A text mining program is provided that allows a user to perform text mining operations, such as: information retrieval, term and document visualization, term and document clustering, term and document classification, summarization of individual documents and groups of documents, and document cross-referencing. This is accomplished by representing the text of a document collection using subspace transformations. This subspace transformation representation is performed by: constructing a term frequency matrix of the term frequencies for each of the documents, transforming the term frequencies for statistical purposes, and projecting the documents or the terms into a lower dimensional subspace. As the document collection is updated, the subspace is dynamically updated to reflect the new document collection.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.