Methods and apparatus for computing graph similarity via sequence similarity
US7996349B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 5, 2007 |
| Grant date | Aug 9, 2011 |
| Priority date | — |
| Expiry date | Jan 6, 2030 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/9558
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
This disclosure describes systems and methods for identifying and correcting anomalies in web graphs. A web graph is transformed into a sequence of tokens via a walk algorithm. The sequence is fingerprinted to form a set of shingles. The singles are compared to shingles for other web graphs in order to determine similarity between web graphs. Actions are then carried out to remove anomalous web graphs and modify parameters governing web mapping in order to decrease the likelihood of future anomalous web graphs being built.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.