Patent · US Active

Methods for document-to-template matching for data-leak prevention

US8254698B2 · kind B2 · utility

9Cited by
5References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 2, 2009
Grant dateAug 28, 2012
Priority date
Expiry dateJun 29, 2031

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F18/22
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The present invention discloses methods for document-to-template matching for data-leak prevention (DLP), the methods including the steps of: providing a document as a stream of characters; splitting the stream into a plurality of serialized data lines; calculating a hash value for each serialized data line; checking for each hash value in a hash map of a template set; determining a similarity match to a particular template based on a predefined threshold of template hash values, of the template set, being found in the stream; and based on the similarity match, executing a DLP security policy for the document. Preferably, the template set is extracted from documents manually prepared by a security administrator. Preferably, each template in the template set is deduced automatically from a plurality of documents.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.