Patent · US Expired

System and method for enhancing crawling by extracting requests for webpages in an information flow

US7093012B2 · kind B2 · utility

146Cited by
7References
26Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 13, 2001
Grant dateAug 15, 2006
Priority date
Expiry dateOct 27, 2023

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F2216/09
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for providing searching and alerting capabilities in traffic content at access points in data networks is disclosed. Typical access points for Internet, intranet and wireless traffic are described. Traffic flow through an Internet Service Provider is used as a preferred embodiment to exemplify the data traffic used as the input source in the invention. The invention teaches how proper privacy and content filters can be applied to the traffic source. The filtered data stream from the traffic flow can be used to improve the quality of existing searching and alerting services. The invention also teaches how a cache can be developed optimized for holding fresh searchable information captured in the traffic flow. It is further disclosed how the said cache can be converted to a searchable index and either separately or in cooperation with external search indexes be used as a basis for improved search services. The invention also discloses how the traffic flow can be analyzed in order to derive added information for measuring document relevance, access similarity between documents, personalized ranking of search results, and regional differences in document accesses.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.