Systems and methods for practical autonomy decision controller
US11107001B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 26, 2018 |
| Grant date | Aug 31, 2021 |
| Priority date | — |
| Expiry date | Dec 27, 2038 |
Classification
- Technology area (CPC B)Performing Operations; Transporting
- CPC primaryB64U2201/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system includes a machine learning engine configured to receive training data including a plurality of input conditions associated with a state space and a plurality of response maneuvers associated with the state space and train a learning system using the training data and a reward function including a plurality of terms associated with a plurality of end state spaces, each term in the plurality of terms defines an end reward value for each end state space. A value function and policy are generated. The value function comprising a plurality of values, wherein each response maneuvers in the plurality of response maneuvers is associated with a value in the plurality of values related to transitioning from the state space to each end state space, the policy indicative of connections between the state spaces, plurality of values, and the respective end reward value for the plurality of end state spaces.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.