Method, system, and computer-readable medium for filtering harmful HTML in an electronic document
US7308648B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 27, 2002 |
| Grant date | Dec 11, 2007 |
| Priority date | — |
| Expiry date | Feb 6, 2024 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/986
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and system are provided for filtering harmful HTML content from an electronic document. An application program interface (API) examines the fundamental structure of the HTML content in the document. The HTML content in the electronic document is parsed into HTML elements and attributes by a tokenizer and compared to a content library by a filter in the API. The filter removes unknown HTML content as well as known content that is listed as harmful in the content library. After the harmful HTML content has removed, a new document is encoded which includes the remaining safe HTML content for viewing in a web browser.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.