Text categorization using external knowledge
US8108204B2 · kind B2 · utility
Inventors
Key dates
| Filing date | Jul 13, 2006 |
| Grant date | Jan 31, 2012 |
| Priority date | — |
| Expiry date | May 20, 2028 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N20/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method of providing weighted concepts related to a sequence of one or more words, including: providing on a computer an encyclopedia with concepts and a document explaining each concept, forming a vector, which contains the frequency of the word for each concept, for each word in the encyclopedia, arranging the vector according to the frequency of appearance of the word for each concept, selecting the concepts with the highest frequencies for each word from the vector, truncating the rest of the vector, inducing a feature generator using the truncated vectors; wherein the feature generator is adapted to receive as input one or more words and provide a list of weighted concepts, which are most related to the one or more words provided as input.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.