Patent · US Active

Privacy driven data subset sizing

US12411988B2 · kind B2 · utility

0Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 18, 2023
Grant dateSep 9, 2025
Priority date
Expiry dateMar 20, 2044

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V10/774
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques are described for maintaining patient privacy in association with obtaining patient data for machine learning applications. In an example, a method can comprise accessing, by a system comprising a processor, a training dataset associated with a machine learning model, the training dataset comprising of data samples respectively comprising unique characteristics of subjects. The method further comprises determining, by the system, characteristic curve information that correlates different data portions extracted from the data samples to respective probabilities of matching the different data portions to respective ones of the data samples from which they are extracted. The method further comprises controlling, by the system, collection of new data portions extracted from new data samples corresponding to the data samples based on the new data portions conforming to criteria that defines a target data portion, wherein the criteria comprise a probability of the respective probabilities that satisfies an anonymity criterion.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.