Patent · US Active

Method, electronic device and computer readable medium for information processing for accelerating neural network training

US11640528B2 · kind B2 · utility

0Cited by

0References

10Claims

0Family size

Assignee

BAIDU USA LLC · US

Inventors

Zhiyu Cheng · Sunnyvale, US
Baopu Li · Santa Clara, US
Yingze Bao · Beijing, CN

Key dates

Filing date	Oct 22, 2019
Grant date	May 2, 2023
Priority date	—
Expiry date	Dec 24, 2041

Classification

Technology area (CPC G)Physics
CPC primaryG06N20/00
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method for information processing for accelerating neural network training. The method includes: acquiring a neural network corresponding to a deep learning task; and performing iterations of iterative training on the neural network based on a training data set. The training data set includes task data corresponding to the deep learning task. The iterative training includes: processing the task data in the training data set using a current neural network, and determining, based on a processing result of the neural network on the task data in a current iterative training, prediction loss of the current iterative training; determining a learning rate and a momentum in the current iterative training; and updating weight parameters of the current neural network by gradient descent based on a preset weight decay, and the learning rate, the momentum, and the prediction loss in the current iterative training. This method achieves efficient and low-cost deep learning-based neural network training.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.