Patent · US Active

Using hash signatures of DOM objects to identify website similarity

US10757101B2 · kind B2 · utility

3Cited by
8References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 25, 2018
Grant dateAug 25, 2020
Priority date
Expiry dateSep 25, 2038

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04L2463/103
  • WIPO fieldDigital communication
  • WIPO sectorElectrical engineering

Abstract

Embodiments are directed to using a hash signature of a rendered DOM object of a website to find similar content and behavior on other websites. Embodiments break a DOM into a large number of data portions (i.e., “shingles”), apply a hashing algorithm to the shingles, select a predetermined number of hashes from the hashed shingles according to a selection criteria to create a hash signature, and compare the hash signature to that of a reference page to determine similarity of website DOM object content. Embodiments can be used to identify phishing websites, defaced websites, spam websites, significant changes in the content of a webpage, copyright infringement, and any other suitable purposes related to the similarity between website DOM object content.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.