Patent · US Active

Method and system for learning reward functions for driving using positive-unlabeled reward learning

US12354415B2 · kind B2 · utility

0Cited by
1References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 27, 2022
Grant dateJul 8, 2025
Priority date
Expiry dateNov 10, 2043

Classification

  • Technology area (CPC B)Performing Operations; Transporting
  • CPC primaryB60W2555/00
  • WIPO fieldControl
  • WIPO sectorInstruments

Abstract

A method includes receiving first driving data associated with a first vehicle, receiving second driving data associated with one or more vehicles around the first vehicle, creating training data by labeling the first driving data as positive data and treating the second driving data as unlabeled, and using the training data to train a classifier to predict whether driving data input to the classifier is positive or unlabeled.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.