Patent · US Active

Clustering repetitive structure of asynchronous web application content

US9734149B2 · kind B2 · utility

3Cited by
0References
9Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 2, 2015
Grant dateAug 15, 2017
Priority date
Expiry dateJul 14, 2035

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/986
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A processor determines whether a DOM includes a repetitive pattern of a combination, formed by a tag of a leaf node and a tag of a parent node of the leaf node. Determining the repetitive pattern of the combination, the processor identifies a first inner cluster is identified by collapsing multiple instances of the repetitive pattern into a single instance. The processor generates a LSH signature for the single instance of the repetitive pattern. The processor determines an outer cluster, based on grouping one or more inner clusters, as part of a section rooted at a source node of the DOM, in which the source node is a parent node of the one or more inner clusters. Determining that a pair of outer clusters are near repetitive, the processor limits web content exploration to one of the pair of outer clusters.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.