Fine-tuning policies to facilitate chaining
US12430564B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 1, 2022 |
| Grant date | Sep 30, 2025 |
| Priority date | — |
| Expiry date | Aug 1, 2044 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N7/01
- WIPO fieldControl
- WIPO sectorInstruments
Abstract
A manipulation task may include operations performed by one or more manipulation entities on one or more objects. This manipulation task may be broken down into a plurality of sequential sub-tasks (policies). These policies may be fine-tuned so that a terminal state distribution of a given policy matches an initial state distribution of another policy that immediately follows the given policy within the plurality of policies. The fine-tuned plurality of policies may then be chained together and implemented within a manipulation environment.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.