Patent · US Active

Sitemap generating client for web crawler

US8037055B2 · kind B2 · utility

9Cited by
23References
30Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 23, 2010
Grant dateOct 11, 2011
Priority date
Expiry dateAug 23, 2030

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/951
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods and systems for a sitemap generating client for web crawlers are described. The client accesses one or more sources of document information about the documents available on a website, such as the file system, access logs, or pre-made URL lists. Document information is extracted from the sources and one or more sitemaps are generated based on the extracted document information. A notification is transmitted to a remote computer, informing that the sitemap(s) are available for access and likely have been updated. If the remote computer is associated with a web crawler, the remote computer may access the sitemap(s) and use the sitemaps to schedule a crawl of documents included or available on the website.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.