Patent · US Active

Method, electronic device and computer readable medium for information processing for accelerating neural network training

US11640528B2 · kind B2 · utility

0Cited by
0References
10Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 22, 2019
Grant dateMay 2, 2023
Priority date
Expiry dateDec 24, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N20/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for information processing for accelerating neural network training. The method includes: acquiring a neural network corresponding to a deep learning task; and performing iterations of iterative training on the neural network based on a training data set. The training data set includes task data corresponding to the deep learning task. The iterative training includes: processing the task data in the training data set using a current neural network, and determining, based on a processing result of the neural network on the task data in a current iterative training, prediction loss of the current iterative training; determining a learning rate and a momentum in the current iterative training; and updating weight parameters of the current neural network by gradient descent based on a preset weight decay, and the learning rate, the momentum, and the prediction loss in the current iterative training. This method achieves efficient and low-cost deep learning-based neural network training.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.