Method and apparatus for finding mirrored hosts by analyzing connectivity and IP addresses
US6487555B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | May 7, 1999 |
| Grant date | Nov 26, 2002 |
| Priority date | — |
| Expiry date | May 7, 2019 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/951
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and system that detects mirrored host pairs using information about a large set of pages, including one or more of: URLs, IP addresses, and connectivity information. The identities of the detected mirrored hosts are then saved so that browsers, crawlers, proxy servers, or the like can correctly identify mirrored web sites. The described embodiments of the present invention use one or a combination of techniques to identify mirrors. A first group of techniques involves determining mirrors based on URLs and information about connectivity (i.e., hyperlinks) between pages. A second group of techniques looks at connectivity information at a higher granularity, considering all links from all pages on a host as one group and ignoring the target of each link beyond the host level.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.