Patent · US Active

Robotic control using value distributions

US11571809B1 · kind B1 · utility

2Cited by
15References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 11, 2020
Grant dateFeb 7, 2023
Priority date
Expiry dateMay 11, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG05B2219/40499
  • WIPO fieldHandling
  • WIPO sectorMechanical engineering

Abstract

Techniques are described herein for robotic control using value distributions. In various implementations, as part of performing a robotic task, state data associated with the robot in an environment may be generated based at least in part on vision data captured by a vision component of the robot. A plurality of candidate actions may be sampled, e.g., from continuous action space. A trained critic neural network model that represents a learned value function may be used to process a plurality of state-action pairs to generate a corresponding plurality of value distributions. Each state-action pair may include the state data and one of the plurality of sampled candidate actions. The state-action pair corresponding to the value distribution that satisfies one or more criteria may be selected from the plurality of state-action pairs. The robot may then be controlled to implement the sampled candidate action of the selected state-action pair.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.