Patent · US Active

Document reuse in a search engine crawler

US10216847B2 · kind B2 · utility

0Cited by
71References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 8, 2017
Grant dateFeb 26, 2019
Priority date
Expiry dateJun 8, 2037

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/951
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems and method are provided for setting a respective reuse flag for a corresponding document in a plurality of documents based on a query-independent score associated with the corresponding document. A document crawling operation is performed on the plurality of documents in accordance with the reuse flag for respective documents in the plurality of documents. This document crawling operation includes reusing a previously downloaded version of a respective document in the plurality of documents instead of downloading a current version of the respective document from a host computer in accordance with a determination that the reuse flag associated with the respective document meets a predefined criterion.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.