Electronic medical record datasifter
US10776516B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Aug 1, 2018 |
| Grant date | Sep 15, 2020 |
| Priority date | — |
| Expiry date | Mar 16, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N20/20
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method is presented for generating a data set from a database. The method involves iterative data manipulation that stochastically identifies candidate entries from the cases (subjects, participants) and variables (data elements) and subsequently selects, nullifies, and imputes the information. This process heavily relies on statistical multivariate imputation to preserve the joint distributions of the complex structured data archive. At each step, the algorithm generates a complete dataset that in aggregate closely resembles the intrinsic characteristics of the original data set, however, on an individual level the rows of data are substantially altered. This procedure drastically reduces the risk for subject reidentification by stratification, as meta-data for all subjects is repeatedly and lossily encoded.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.