Detecting indicators of misleading content in markup language coded documents using the formatting of the document
US7895515B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 28, 2007 |
| Grant date | Feb 22, 2011 |
| Priority date | — |
| Expiry date | Feb 25, 2029 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/9024
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for detecting indicators of misleading content in a markup language coded document is provided. The method includes extracting a set of tags from the markup language coded document. The method also includes combining tags to create a tag structure signature. The tag structure signature is configured to include a set of n-grams. Each of the set of n-grams includes at least two tags from the set of tags. The method further includes comparing the tag structure signature against a set of known bad tag structure signatures to determine similarity.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.