Patent · US Active

Identifying personally identifiable information within an unstructured data store

US11263341B1 · kind B1 · utility

2Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 11, 2018
Grant dateMar 1, 2022
Priority date
Expiry dateMay 10, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F21/6245
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods and systems for identifying personally identifiable information (PII) are disclosed. In some aspects, frequency maps of fields storing known PII information are generated. The frequency maps may count occurrences of unique bigrams in the PII fields. A field of interest may then be analyzed to generate a second frequency map. Correlations between the first frequency maps and the second frequency map may be generated. If one of the correlations meets certain criterion, the disclosed embodiments may determine that the field of interest does or does not include PII. Access control for the field of interest may then be based on whether the field includes PII. In some aspects, a storage location of data included in the field of interest may be based on whether the field includes PII.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.