Patent · US Active

Text categorization using external knowledge

US8108204B2 · kind B2 · utility

42Cited by
3References
8Claims
0Family size

Inventors

Key dates

Filing dateJul 13, 2006
Grant dateJan 31, 2012
Priority date
Expiry dateMay 20, 2028

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N20/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method of providing weighted concepts related to a sequence of one or more words, including: providing on a computer an encyclopedia with concepts and a document explaining each concept, forming a vector, which contains the frequency of the word for each concept, for each word in the encyclopedia, arranging the vector according to the frequency of appearance of the word for each concept, selecting the concepts with the highest frequencies for each word from the vector, truncating the rest of the vector, inducing a feature generator using the truncated vectors; wherein the feature generator is adapted to receive as input one or more words and provide a list of weighted concepts, which are most related to the one or more words provided as input.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.