Patent · US Expired

Method for estimating the probability of collisions of fingerprints

US5974481A · kind A · utility

17Cited by
2References
8Claims
0Family size

Assignee

Inventor

Key dates

Filing dateSep 15, 1997
Grant dateOct 26, 1999
Priority date
Expiry dateSep 15, 2017

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/90344
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Strings, such as Web pages or other documents, are fingerprinted in order to detect substantially similar strings, so as to avoid processing duplicate strings. At the same time determine a computerized method estimates the probability that a collision among fingerprints of dissimilar strings. As fingerprints are generated for strings presented for processing, when the fingerprint of a string is determined not to be identical to any fingerprint in a set of stored fingerprints, the new fingerprint is masked and the unmasked portion of the fingerprint is compared with a corresponding portion of the fingerprints in the stored set. Information is recorded regarding the number of matching masked fingerprints.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.