Patent · US Active

Systems and methods for detecting personally identifiable information

US11755766B2 · kind B2 · utility

3Cited by
0References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 15, 2020
Grant dateSep 12, 2023
Priority date
Expiry dateJan 25, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N20/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The disclosure generally relates to systems and methods for detecting personally identifiable information (PII). The present systems and methods solve the problem of detecting the PII and the PII column names in the customer database with enhanced accuracy, by developing a PII classification model trained with an enhanced and effective training dataset. An enhanced sub-metadata from the metadata having the plurality of the column names is obtained by using highest match distance values, the string comparator values, and the is PII indicator values. The enhanced sub-metadata comprising the column names that can be easily differentiated as PII columns or non-PII columns. Hence the training dataset and the testing dataset obtained from the enhanced sub-metadata improves the accuracy of the PII classification model. Preventive measures can be taken to protect such detected PII present under the PII columns by employing various data privacy and protection techniques.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.