Patent · US Active

Systems and methods for generating machine learning-based classifiers for detecting specific categories of sensitive information

US8688601B2 · kind B2 · utility

308Cited by
1References
20Claims
0Family size

Assignee

Inventor

Key dates

Filing dateJul 26, 2011
Grant dateApr 1, 2014
Priority date
Expiry dateMar 7, 2032

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N20/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A computer-implemented method may include (1) identifying a plurality of specific categories of sensitive information to be protected by a DLP system, (2) obtaining a training data set for each specific category of sensitive information that includes a plurality of positive and a plurality of negative examples of the specific category of sensitive information, (3) using machine learning to train, based on an analysis of the training data sets, at least one machine learning-based classifier that is capable of detecting items of data that contain one or more of the plurality of specific categories of sensitive information, and then (4) deploying the machine learning-based classifier within the DLP system to enable the DLP system to detect and protect items of data that contain one or more of the plurality of specific categories of sensitive information in accordance with at least one DLP policy of the DLP system.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.