Patent · US Active

Dataset quality for synthetic data generation in computer-based reasoning systems

US11640561B2 · kind B2 · utility

2Cited by
18References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 28, 2021
Grant dateMay 2, 2023
Priority date
Expiry dateMay 28, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/006
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques for synthetic data generation in computer-based reasoning systems are discussed and include receiving a request for generation of synthetic data based on a set of training data cases. One or more focal training data cases are determined. For undetermined features (either all of them or those that are not subject to conditions), a value for the feature is determined based on the focal cases. In some embodiments, the generated synthetic data may be checked for similarity against the training data, and if similarity conditions are met, it may be modified (e.g., resampled), removed, and/or replaced.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.