METHOD AND SYSTEM FOR LEARNING REWARD FUNCTIONS FOR DRIVING USING POSITIVE-UNLABELED REWARD LEARNING
DRIVE
July 27, 2023
A method includes receiving first driving data associated with a first vehicle, receiving second driving data associated with one or more vehicles around the first vehicle, creating training data by labeling the first driving data as positive data and treating the second driving data as unlabeled, and using the training data to train a classifier to predict whether driving data input to the classifier is positive or unlabeled.
Discussion in the ATmosphere