Patent · US Active

Overcoming data missingness for improving predictions

US12198025B2 · kind B2 · utility

0Cited by
0References
30Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 29, 2022
Grant dateJan 14, 2025
Priority date
Expiry dateJul 29, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N20/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Disclosed herein are methods for training and deploying a predictive model for generating a prediction, e.g., patient eligibility for a CAR-T therapy. Datasets, such as open healthcare claims datasets, may be missing data. Missing data may hamper the ability to generate sufficient information needed for training a predictive model. Methods include leveraging comprehensive datasets, such as closed claims datasets, to create training examples for input into a machine learning algorithm. In various embodiments, the comprehensive dataset is modified to simulate the data missingness in the target dataset; then, the modified dataset is paired with the ground truth label derived from the comprehensive dataset to create training examples. In various embodiments, a comprehensive dataset is paired with a target dataset to create training examples. After training a predictive model on such examples, the model can be deployed to make predictions in the target dataset even in light of missing data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.