Patent · US Active

Web page clustering method and device

US11023540B2 · kind B2 · utility

0Cited by
1References
9Claims
0Family size

Assignees

Inventors

Key dates

Filing dateNov 24, 2017
Grant dateJun 1, 2021
Priority date
Expiry dateMay 11, 2038

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/986
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A web page clustering method and device, used for clustering web pages according to a web page framework, the method including: acquiring uniform resource locators (URL) of a plurality of web pages to be clustered; for the URL of each web page to be clustered, determining rewriting rules of the URL and classifying the URL according to the rewriting rules of the URL; determining a web page framework of the web page corresponding to each URL in each URL class, and determining whether each URL may be clustered according to the web page framework of the web page corresponding to each URL; and retaining the URL class if each URL may be clustered.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.