Patent · US Expired

Methods, apparatus and computer program products for information retrieval and document classification utilizing a multidimensional subspace

US6701305B1 · kind B1 · utility

352Cited by
33References
47Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 20, 2000
Grant dateMar 2, 2004
Priority date
Expiry dateMar 7, 2022

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/353
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods, apparatus and computer program products are provided for retrieving information from a text data collection and for classifying a document into none, one or more of a plurality of predefined classes. In each aspect, a representation of at least a portion of the original matrix is projected into a lower dimensional subspace and those portions of the subspace representation that relate to the term(s) of the query are weighted following the projection into the lower dimensional subspace. In order to retrieve the documents that are most relevant with respect to a query, the documents are then scored with documents having better scores being of generally greater relevance. Alternatively, in order to classify a document, the relationship of the document to the classes of documents is scored with the document then being classified in those classes, if any, that have the best scores.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.