Patent · US Active

Adversarial reinforcement learning for procedural content generation and improved generalization

US11883746B2 · kind B2 · utility

0Cited by
36References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 17, 2021
Grant dateJan 30, 2024
Priority date
Expiry dateJan 30, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/044
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods, apparatus and systems are provided for training a first reinforcement-learning (RL) agent and a second RL agent coupled to a computer game environment using RL techniques. The first RL agent iteratively generates a sub-goal sequence in relation to an overall goal within the computer game environment, where the first RL agent generates a new sub-goal for the sub-goal sequence after a second RL agent, interacting with the computer game environment, successfully achieves a current sub-goal in the sub-goal sequence. The second RL agent iteratively interacts with the computer game environment to achieve the current sub-goal in which each iterative interaction includes an attempt by the second RL agent for interacting with the computer game environment to achieve the current sub-goal. The first RL agent is updated using a first reward issued when the second RL agent successfully achieves the current sub-goal. The second RL agent is updated when a second reward is issued by the computer game environment based on the performance of the second RL agent attempting to achieve said current sub-goal. Once validly trained, the first RL agent forms a final first RL agent for automatic pr…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.