Patent · US Active

Information processing apparatus and information processing method

US11475292B2 · kind B2 · utility

0Cited by
0References
7Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 14, 2020
Grant dateOct 18, 2022
Priority date
Expiry dateJun 30, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/0985
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Each of a plurality of processors enters, to a model representing a neural network and including a common first weight, first data different from that used by the other processors, calculates an error gradient for the first weight, and integrates the gradients calculated by each processor. Each processor stores the first weight in a memory and updates the weight of the model to a second weight based on a hyperparameter value different from those used by the other processors, the integrated error gradient, and the first weight. Each processor enters common second data to the model, compares the evaluation results acquired by each processor, and selects a common hyperparameter value. Each processor updates the weight of the model to a third weight based on the selected hyperparameter value, the integrated error gradient, and the first weight stored in the memory.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.