Method and system for efficient and exhaustive URL categorization
US8935390B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 8, 2010 |
| Grant date | Jan 13, 2015 |
| Priority date | — |
| Expiry date | Jan 1, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/955
- WIPO fieldDigital communication
- WIPO sectorElectrical engineering
Abstract
The present method and system relate to categorizing URLs (Uniform Resource Locators) of web pages accessed by multiple users over an IP (Internet Protocol) based data network. The method and system collect real time data from IP data traffic occurring on the IP based data network, and extract parameters from the collected real time data, the parameters including an URL of a web page. The URL is processed by a rule based categorization engine, to associate a matching category to the URL of the web page. When no matching category is inferred, the URL is transferred to a semantic based categorization engine. A matching category is associated to the transferred URL by the semantic based categorization engine, based on a semantic analysis of the textual content extracted from the web page associated to the URL.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.