Patent · US Active

Methods and apparatus for content fingerprinting for information leakage prevention

US8032757B1 · kind B1 · utility

5Cited by
3References
11Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 16, 2008
Grant dateOct 4, 2011
Priority date
Expiry dateMay 9, 2030

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F21/6272
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Processes for fingerprinting a document and for preventing information leakage at a deployment point are disclosed. For fingerprinting a document, a sequence of hash values for a document is generated, a portion of said hash values to be selected as fingerprints for the document. A current window is positioned over a portion of the sequence of hash values. The hash values are examined starting from one end of the current window, and a first-encountered hash value that is 0 modulo P is selected to be a fingerprint for the current window. For information leakage prevention at a deployment point, a rolling hash calculation is performed on a target document, and a determination is made if a hash value is 0 modulo P. A first filter is applied if the hash value is 0 modulo P, and a second filter is otherwise applied. Other embodiments, aspects and features are also disclosed.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.