Sensitivity categorization of web pages
US8589231B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 28, 2010 |
| Grant date | Nov 19, 2013 |
| Priority date | — |
| Expiry date | Nov 1, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06Q30/0277
- WIPO fieldIT methods for management
- WIPO sectorElectrical engineering
Abstract
Methods, systems, and computer programs for categorizing the sensitivity of web pages are presented. In one method, a space of sensitive pages is identified based on the sensitivity categorization of a first plurality of web pages and a second plurality of web pages. The first plurality of web pages is obtained by performing search queries using known sensitive words, and the second plurality of web pages includes randomly selected web pages. Additionally, the method identifies a third plurality of web pages that includes web pages on or near the boundary between the space of sensitive pages and the space of non-sensitive pages. The space of sensitive pages is then redefined based on the sensitivity categorization of the first, second, and third pluralities of web pages. Once the space of sensitive pages is defined, the method is used to determine that a given web page is sensitive when the given web page is in the space of sensitive pages. Web pages are included in a marketing operation when the web pages are not sensitive.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.