Patent · US Active

Reinforcement learning algorithm search

US12430551B2 · kind B2 · utility

0Cited by
0References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 3, 2021
Grant dateSep 30, 2025
Priority date
Expiry dateJun 6, 2044

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/092
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods, computer systems, and apparatus, including computer programs encoded on computer storage media, for generating and searching reinforcement learning algorithms. In some implementations, a computer-implemented system generates a sequence of candidate reinforcement learning algorithms. Each candidate reinforcement learning algorithm in the sequence is configured to receive an input environment state characterizing a state of an environment and to generate an output that specifies an action to be performed by an agent interacting with the environment. For each candidate reinforcement learning algorithm in the sequence, the system performs a performance evaluation for a set of a plurality of training environments. For each training environment, the system adjusts a set of environment-specific parameters of the candidate reinforcement learning algorithm by performing training of the candidate reinforcement learning algorithm to control a corresponding agent in the training environment. The system generates an environment-specific performance metric for the candidate reinforcement learning algorithm that measures a performance of the candidate reinforcement learning algorithm in …

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.