Patent · US Active

Microhubs and its applications

US8041705B2 · kind B2 · utility

1Cited by
2References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 5, 2009
Grant dateOct 18, 2011
Priority date
Expiry dateJan 29, 2030

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99937
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and method of crawling at least one website comprising at least one URL includes maintaining a lookup structure comprising all of the URLs known to be on a website; calculating a hub score for each webpage of the website to be recrawled, wherein the hub score measures how likely the to be recrawled webpage includes links to fresh content published on the website; sorting all the to be recrawled pages by their hub scores; and crawling the to be recrawled pages in order from highest hub scores to lowest hub scores. The calculating comprises computing a first value equaling a percentage of a number of new relative URLs on the to be recrawled page; computing a second value equaling a percentage of a previous hub score of the to be recrawled page; and computing the hub score as a sum of the first and the second values.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.