Training sample set generation from imbalanced data in view of user goals
US11836219B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 3, 2021 |
| Grant date | Dec 5, 2023 |
| Priority date | — |
| Expiry date | Jun 15, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N20/00
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
One embodiment provides a method, including: receiving a sample set for training a machine-learning model, wherein the sample set includes a plurality of classes, wherein classes within the plurality of classes have an imbalance in a number of samples; creating an enlarged minority class by generating new samples from the samples within the minority class and adding the new samples to the minority class; selecting subset samples from both the samples within the enlarged minority class and the majority class; weighting each of the subset samples based upon user input defining goals for attributes of a training sample set to be used in training the machine-learning model; and generating, using the neural network, the training sample set by re-running the selecting in view of the weighting.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.