Patent · US Expired

Method, system, and computer-readable medium for filtering harmful HTML in an electronic document

US7308648B1 · kind B1 · utility

71Cited by
7References
25Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 27, 2002
Grant dateDec 11, 2007
Priority date
Expiry dateFeb 6, 2024

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/986
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method and system are provided for filtering harmful HTML content from an electronic document. An application program interface (API) examines the fundamental structure of the HTML content in the document. The HTML content in the electronic document is parsed into HTML elements and attributes by a tokenizer and compared to a content library by a filter in the API. The filter removes unknown HTML content as well as known content that is listed as harmful in the content library. After the harmful HTML content has removed, a new document is encoded which includes the remaining safe HTML content for viewing in a web browser.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.