Patent · US Active

Detecting indicators of misleading content in markup language coded documents using the formatting of the document

US7895515B1 · kind B1 · utility

15Cited by
6References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 28, 2007
Grant dateFeb 22, 2011
Priority date
Expiry dateFeb 25, 2029

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/9024
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for detecting indicators of misleading content in a markup language coded document is provided. The method includes extracting a set of tags from the markup language coded document. The method also includes combining tags to create a tag structure signature. The tag structure signature is configured to include a set of n-grams. Each of the set of n-grams includes at least two tags from the set of tags. The method further includes comparing the tag structure signature against a set of known bad tag structure signatures to determine similarity.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.