Patent · US Expired

Method of thematic classification of documents, themetic classification module, and search engine incorporating such a module

US7003519B1 · kind B1 · utility

260Cited by
1References
13Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 22, 2000
Grant dateFeb 21, 2006
Priority date
Expiry dateJul 18, 2021

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99937
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method of thematically classifying documents, in particular for making up or updating thematic databases (42) for a search engine, includes the steps of selecting documents representative of each theme, identifying within the selected documents, elements that are characteristic of each theme, allocating a coefficient (R) to each identified element, said coefficient being representative of the relevance of said element relative to the corresponding theme, and for each document (50) for classification, identifying said elements characteristic of each theme contained in the document and, for each theme corresponding thereto, using the coefficients allocated to said elements to calculate the value of a characteristic representative of the relevance of the theme for the document (50), in order to decide whether or not the document relates to the theme.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.