Patent · US Active

Identifying unvisited portions of visited information

US9916337B2 · kind B2 · utility

1Cited by
6References
14Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 5, 2016
Grant dateMar 13, 2018
Priority date
Expiry dateJul 5, 2036

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04L67/02
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Identifying unvisited portions of visited information to visit includes receiving information to crawl, wherein the information is representative of one of web based information and non-web based information, computing a locality sensitive hash (LSH) value for the received information, and identifying a most similar information visited thus far. Identifying unvisited portions of visited information further includes determining whether the LSH of the received information is equivalent to most similar information visited thus far and, responsive to a determination that the LSH of the received information is not equivalent to most similar information visited thus far, identifying a visited portion of the received information using information for most similar information visited thus far and crawling only unvisited portions of the received information.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.