Patent · US Active

Distributed training and prediction using elastic resources

US11003992B2 · kind B2 · utility

16Cited by

3References

20Claims

0Family size

Assignee

FACEBOOK, INC. · US

Inventors

Lukasz Wesolowski · Redwood City, US
Mohamed Fawzi Mokhtar Abd El Aziz · Kirkland, US
Aditya Rajkumar Kalro · Los Gatos, US
Hongzhong Jia · Mountain View, US
Jay Parikh · Redwood City, US

Key dates

Filing date	Oct 16, 2017
Grant date	May 11, 2021
Priority date	—
Expiry date	Mar 2, 2040

Classification

Technology area (CPC G)Physics
CPC primaryG06N3/105
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

In one embodiment, a method includes establishing access to first and second different computing systems. A machine learning model is assigned for training to the first computing system, and the first computing system creates a check-point during training in response to a first predefined triggering event. The check-point may be a record of an execution state in the training of the machine learning model by the first computing system. In response to a second predefined triggering event, the training of the machine learning model on the first computing system is halted, and in response to a third predefined triggering event, the training of the machine learning model is transferred to the second computing system, which continues training the machine learning model starting from the execution state recorded by the check-point.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.