Patent · US Active

System and method for automatically identifying classified websites

US8682882B2 · kind B2 · utility

1Cited by
9References
22Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 29, 2013
Grant dateMar 25, 2014
Priority date
Expiry dateJan 29, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/958
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems, methods, and computer readable storage mediums are provided to automatically identifying a classified website. A website is determined to be a candidate site based on a set of heuristics. From among pages constituting the candidate site one or more pages are determined to be listing page candidates and one or more pages are determined to be detail page candidates. Then a listing page score is determined using a listing page classifier. Similarly, a detail page score is determined using a detail page classifier. The listing page and detail page scores each indicate the likelihood that the pages are part of a classified website. A candidate site score is determined based in part on a combination of the listing page score and the detail page scores. Then when the candidate site score is above a threshold the candidate site is determined to be a classified website.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.