Patent · US Active

Categorization automation based on category ontology

US8489523B2 · kind B2 · utility

3Cited by
0References
17Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 31, 2011
Grant dateJul 16, 2013
Priority date
Expiry dateJan 11, 2032

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/955
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for categorization using multiple categories including obtaining multiple uniform resource locators (URLs) associated with the multiple categories, collecting multiple web pages identified by the multiple URLs, generating vocabulary terms based on the multiple web pages, generating an N-gram file including the multiple vocabulary terms, generating multiple classified URLs by labeling the plurality of URLs based on the multiple categories, generating multiple feature vectors by processing the classified URLs and the multiple web pages against the N-gram file, generating a categorization model by applying a machine learning algorithm to the multiple feature vectors, and loading a classifier with the categorization module and the N-gram file.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.