Patent · US Active

Inverse reinforcement learning for user-specific behaviors

US11710072B1 · kind B1 · utility

0Cited by
0References
25Claims
0Family size

Inventors

Key dates

Filing dateJun 28, 2021
Grant dateJul 25, 2023
Priority date
Expiry dateJun 28, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06T2200/24
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

In one implementation, a method for inverse reinforcement learning for tailoring virtual agent behaviors to a specific user. The method includes: obtaining an initial behavior model for a virtual agent and an initial state for a virtual environment associated with the virtual agent, wherein the initial behavior model includes one or more tunable parameters; generating, based on the initial behavior model and the initial state for the virtual environment, a first set of behavioral trajectories for the virtual agent; obtaining a second set of behavioral trajectories from a source different from the initial behavior model; and generating an updated behavior model by adjusting at least one of the one or more tunable parameters of the initial behavior model as a function of the first and second sets of behavioral trajectories, wherein at least one of the first and second sets of behavioral trajectories are assigned different weights.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.