Patent · US Active

Identification and classification of sensitive information in data catalog objects

US12038948B2 · kind B2 · utility

0Cited by
1References
21Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 22, 2021
Grant dateJul 16, 2024
Priority date
Expiry dateDec 22, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F21/6254
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A data catalog system is described that includes capabilities for automatically identifying and classifying sensitive information stored in data objects associated with various data sources. The data catalog system identifies a data object associated with a data asset stored in a data catalog metadata repository and computes a sensitivity score for the data object based on a set of one or more sensitive data identification techniques. The system determines a set of enrichment labels for the data object based on the sensitivity score computed for the data object. The enrichment labels are used to further qualify, enrich, or classify the data objects identified as containing sensitive information. For instance, the enrichment labels may identify a set of custom properties to be assigned to a data object, identify glossary terms to be applied to the data object or the enrichment labels may identify tags to be assigned to the data object.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.