Web page clustering method and device
US11023540B2 · kind B2 · utility
Assignees
Inventors
Key dates
| Filing date | Nov 24, 2017 |
| Grant date | Jun 1, 2021 |
| Priority date | — |
| Expiry date | May 11, 2038 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/986
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A web page clustering method and device, used for clustering web pages according to a web page framework, the method including: acquiring uniform resource locators (URL) of a plurality of web pages to be clustered; for the URL of each web page to be clustered, determining rewriting rules of the URL and classifying the URL according to the rewriting rules of the URL; determining a web page framework of the web page corresponding to each URL in each URL class, and determining whether each URL may be clustered according to the web page framework of the web page corresponding to each URL; and retaining the URL class if each URL may be clustered.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.