Methods and apparatus for content fingerprinting for information leakage prevention
US8032757B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | May 16, 2008 |
| Grant date | Oct 4, 2011 |
| Priority date | — |
| Expiry date | May 9, 2030 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F21/6272
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Processes for fingerprinting a document and for preventing information leakage at a deployment point are disclosed. For fingerprinting a document, a sequence of hash values for a document is generated, a portion of said hash values to be selected as fingerprints for the document. A current window is positioned over a portion of the sequence of hash values. The hash values are examined starting from one end of the current window, and a first-encountered hash value that is 0 modulo P is selected to be a fingerprint for the current window. For information leakage prevention at a deployment point, a rolling hash calculation is performed on a target document, and a determination is made if a hash value is 0 modulo P. A first filter is applied if the hash value is 0 modulo P, and a second filter is otherwise applied. Other embodiments, aspects and features are also disclosed.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.