Classification of top-level domain (TLD) websites based on a known website classification
US10148700B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 30, 2016 |
| Grant date | Dec 4, 2018 |
| Priority date | — |
| Expiry date | Dec 7, 2036 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04L63/1433
- WIPO fieldDigital communication
- WIPO sectorElectrical engineering
Abstract
Systems and methods for classification of web sites and/or their corresponding URLs based on a known web site classification are provided. According to one embodiment, a website URL is received that is known to be associated with a particular content classification. A list of candidate domain names including a host name of the website URL is generated based on a defined TLD list. For each of the candidate domain names it is determined whether an IP address of the candidate domain name is equal to an IP address of the website URL. When the result is affirmative, the particular content classification is associated with the candidate domain name; otherwise, a cosine similarity measurement process is performed between information associated with the candidate domain name and information associated with the website URL to determine whether to associate the particular content classification with the candidate domain name.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.