System and method for enhancing crawling by extracting requests for webpages in an information flow
US7093012B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 13, 2001 |
| Grant date | Aug 15, 2006 |
| Priority date | — |
| Expiry date | Oct 27, 2023 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F2216/09
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for providing searching and alerting capabilities in traffic content at access points in data networks is disclosed. Typical access points for Internet, intranet and wireless traffic are described. Traffic flow through an Internet Service Provider is used as a preferred embodiment to exemplify the data traffic used as the input source in the invention. The invention teaches how proper privacy and content filters can be applied to the traffic source. The filtered data stream from the traffic flow can be used to improve the quality of existing searching and alerting services. The invention also teaches how a cache can be developed optimized for holding fresh searchable information captured in the traffic flow. It is further disclosed how the said cache can be converted to a searchable index and either separately or in cooperation with external search indexes be used as a basis for improved search services. The invention also discloses how the traffic flow can be analyzed in order to derive added information for measuring document relevance, access similarity between documents, personalized ranking of search results, and regional differences in document accesses.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.