Patent · US Active

Preprocessing and imputing method for structural data

US11841839B1 · kind B1 · utility

0Cited by
0References
5Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 3, 2023
Grant dateDec 12, 2023
Priority date
Expiry dateMay 3, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/094
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The present invention discloses a preprocessing and imputing method for structural data, comprising: step 1, querying the missing information of an original data, counting missing values, and obtaining a missing rate for the original data; step 2, based on the missing rate, performing listwise deletion on the original data, and then traversing the rows to generate corresponding dichotomous arrays, converting the arrays to the form of histogram, calculating the maximum rectangular area formed by the corresponding histogram, and then sorting all rectangular areas to obtain the maximum complete information matrix; step 3, using multiple imputation by chained equations, auto-encoders, or generative adversarial imputation networks to impute missing values on the original data. The present invention can carry out missing information statistics on the original data, automatically search the maximum complete information that meets the conditions, impute the structural data, greatly improve the quality of the original dataset and convenience for subsequent prediction tasks.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.