Patent · US Active

Fine-tuning policies to facilitate chaining

US12430564B2 · kind B2 · utility

0Cited by

3References

19Claims

0Family size

Assignee

NVIDIA Corporation · US

Inventors

Yuke Zhu · Stanford, US
Anima Anandkumar · Santa Clara, US
Youngwoon Lee · Los Angeles, US

Key dates

Filing date	Mar 1, 2022
Grant date	Sep 30, 2025
Priority date	—
Expiry date	Aug 1, 2044

Classification

Technology area (CPC G)Physics
CPC primaryG06N7/01
WIPO fieldControl
WIPO sectorInstruments

Abstract

A manipulation task may include operations performed by one or more manipulation entities on one or more objects. This manipulation task may be broken down into a plurality of sequential sub-tasks (policies). These policies may be fine-tuned so that a terminal state distribution of a given policy matches an initial state distribution of another policy that immediately follows the given policy within the plurality of policies. The fine-tuned plurality of policies may then be chained together and implemented within a manipulation environment.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.