Patent · US Active

Method and system for learning reward functions for driving using positive-unlabeled reward learning

US12354415B2 · kind B2 · utility

0Cited by

1References

16Claims

0Family size

Assignee

TOYOTA RESEARCH INSTITUTE, INC. · US

Inventors

Blake Warren Wulfe · San Francisco, US
Adrien David Gaidon · Mountain View, US

Key dates

Filing date	Jan 27, 2022
Grant date	Jul 8, 2025
Priority date	—
Expiry date	Nov 10, 2043

Classification

Technology area (CPC B)Performing Operations; Transporting
CPC primaryB60W2555/00
WIPO fieldControl
WIPO sectorInstruments

Abstract

A method includes receiving first driving data associated with a first vehicle, receiving second driving data associated with one or more vehicles around the first vehicle, creating training data by labeling the first driving data as positive data and treating the second driving data as unlabeled, and using the training data to train a classifier to predict whether driving data input to the classifier is positive or unlabeled.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.