Patent · US Active

System and method for automatically identifying classified websites

US8380693B1 · kind B1 · utility

8Cited by
9References
22Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 8, 2011
Grant dateFeb 19, 2013
Priority date
Expiry dateSep 8, 2031

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/958
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems, methods, and computer readable storage mediums are provided to automatically identifying a classified website. A website is determined to be a candidate site based on a set of heuristics. From among pages constituting the candidate site one or more pages are determined to be listing page candidates and one or more pages are determined to be detail page candidates. Then a listing page score is determined using a listing page classifier. Similarly, a detail page score is determined using a detail page classifier. The listing page and detail page scores each indicate the likelihood that the pages are part of a classified website. A candidate site score is determined based in part on a combination of the listing page score and the detail page scores. Then when the candidate site score is above a threshold the candidate site is determined to be a classified website.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.