Categorization automation based on category ontology
US8489523B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 31, 2011 |
| Grant date | Jul 16, 2013 |
| Priority date | — |
| Expiry date | Jan 11, 2032 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/955
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for categorization using multiple categories including obtaining multiple uniform resource locators (URLs) associated with the multiple categories, collecting multiple web pages identified by the multiple URLs, generating vocabulary terms based on the multiple web pages, generating an N-gram file including the multiple vocabulary terms, generating multiple classified URLs by labeling the plurality of URLs based on the multiple categories, generating multiple feature vectors by processing the classified URLs and the multiple web pages against the N-gram file, generating a categorization model by applying a machine learning algorithm to the multiple feature vectors, and loading a classifier with the categorization module and the N-gram file.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.