Identifying personally identifiable information within an unstructured data store
US11263341B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 11, 2018 |
| Grant date | Mar 1, 2022 |
| Priority date | — |
| Expiry date | May 10, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F21/6245
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods and systems for identifying personally identifiable information (PII) are disclosed. In some aspects, frequency maps of fields storing known PII information are generated. The frequency maps may count occurrences of unique bigrams in the PII fields. A field of interest may then be analyzed to generate a second frequency map. Correlations between the first frequency maps and the second frequency map may be generated. If one of the correlations meets certain criterion, the disclosed embodiments may determine that the field of interest does or does not include PII. Access control for the field of interest may then be based on whether the field includes PII. In some aspects, a storage location of data included in the field of interest may be based on whether the field includes PII.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.