Patent · US Active

Automatic generation of embedded signatures for duplicate detection on a public network

US7979413B2 · kind B2 · utility

14Cited by
13References
20Claims
0Family size

Assignees

Inventors

Key dates

Filing dateMay 30, 2008
Grant dateJul 12, 2011
Priority date
Expiry dateAug 28, 2029

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/3334
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

In accordance with an aspect of the invention, a method and system are disclosed for constructing an embedded signature in order to facilitate post-facto detection of leakage of sensitive data. The leakage detection mechanism involves: 1) identifying at least one set of words in an electronic document containing sensitive data, the set of words having a low frequency of occurrence in a first collection of electronic documents; and, 2) transmitting a query to search a second collection of electronic documents for any electronic document that contains the set of words having a low frequency of occurrence. This leakage detection mechanism has at least the following advantages: a) it is tamper-resistant; b) it avoids the need to add a watermark to the sensitive data, c) it can be used to locate the sensitive data even if the leakage occurred before the embedded signature was ever identified; and, d) it can be used to detect an embedded signature regardless of whether the data is being presented statically or dynamically.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.