Patent · US Active

Predicting machine learning or deep learning model training time

US11429895B2 · kind B2 · utility

4Cited by

1References

20Claims

0Family size

Assignee

Oracle International Corporation · US

Inventors

Anatoly Yakovlev · Santa Clara, US
Venkatanathan Varadarajan · Seattle, US
Sandeep Agrawal · San Jose, US
Hesam Fathi Moghadam · Sunnyvale, US
Sam Idicula · Santa Clara, US
Nipun Agarwal · Santa Clara, US

Key dates

Filing date	Apr 15, 2019
Grant date	Aug 30, 2022
Priority date	—
Expiry date	Jul 1, 2040

Classification

Technology area (CPC G)Physics
CPC primaryG06N20/20
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Herein are techniques for exploring hyperparameters of a machine learning model (MLM) and to train a regressor to predict a time needed to train the MLM based on a hyperparameter configuration and a dataset. In an embodiment that is deployed in production inferencing mode, for each landmark configuration, each containing values for hyperparameters of a MLM, a computer configures the MLM based on the landmark configuration and measures time spent training the MLM on a dataset. An already trained regressor predicts time needed to train the MLM based on a proposed configuration of the MLM, dataset meta-feature values, and training durations and hyperparameter values of landmark configurations of the MLM. When instead in training mode, a regressor in training ingests a training corpus of MLM performance history to learn, by reinforcement, to predict a training time for the MLM for new datasets and/or new hyperparameter configurations.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.