Method and system for learning reward functions for driving using positive-unlabeled reward learning
US12354415B2 · kind B2 · utility
0Cited by
1References
16Claims
0Family size
Assignee
Inventors
Key dates
| Filing date | Jan 27, 2022 |
| Grant date | Jul 8, 2025 |
| Priority date | — |
| Expiry date | Nov 10, 2043 |
Classification
- Technology area (CPC B)Performing Operations; Transporting
- CPC primaryB60W2555/00
- WIPO fieldControl
- WIPO sectorInstruments
Abstract
A method includes receiving first driving data associated with a first vehicle, receiving second driving data associated with one or more vehicles around the first vehicle, creating training data by labeling the first driving data as positive data and treating the second driving data as unlabeled, and using the training data to train a classifier to predict whether driving data input to the classifier is positive or unlabeled.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.