Patent · US Active

Method and system for web document clustering

US8185530B2 · kind B2 · utility

4Cited by
2References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 11, 2008
Grant dateMay 22, 2012
Priority date
Expiry dateOct 13, 2029

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/9558
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Method and system for web documents clustering are provided. The method for web documents clustering includes: inputting a plurality of web documents, collecting information of the links and the directory structure of the inputted web documents, extracting, according to the collected links and directory structure, a hierarchical structure for the plurality of web documents and generating and outputting, based on the extracted hierarchical structure, one or more clusters of the plurality of web documents.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.