Information processing apparatus and information processing method
US11475292B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 14, 2020 |
| Grant date | Oct 18, 2022 |
| Priority date | — |
| Expiry date | Jun 30, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N3/0985
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Each of a plurality of processors enters, to a model representing a neural network and including a common first weight, first data different from that used by the other processors, calculates an error gradient for the first weight, and integrates the gradients calculated by each processor. Each processor stores the first weight in a memory and updates the weight of the model to a second weight based on a hyperparameter value different from those used by the other processors, the integrated error gradient, and the first weight. Each processor enters common second data to the model, compares the evaluation results acquired by each processor, and selects a common hyperparameter value. Each processor updates the weight of the model to a third weight based on the selected hyperparameter value, the integrated error gradient, and the first weight stored in the memory.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.