Modeling Human Driving Behavior Through Generative Adversarial Imitation Learning

Raunak Bhattacharyya; Blake Wulfe; Derek J. Phillips; Alex Kuefler; Jeremy Morton; Ransalu Senanayake; Mykel J. Kochenderfer

doi:10.1109/tits.2022.3227738

ScienceGate Book Chapters

JOURNAL ARTICLE

Modeling Human Driving Behavior Through Generative Adversarial Imitation Learning

Raunak Bhattacharyya Blake Wulfe Derek J. Phillips Alex Kuefler Jeremy Morton Ransalu Senanayake Mykel J. Kochenderfer

Year: 2022 Journal: IEEE Transactions on Intelligent Transportation Systems Vol: 24 (3)Pages: 2874-2887 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tits.2022.3227738

Get Full-Text PDF Get Analytical Report

Abstract

An open problem in autonomous vehicle safety validation is building reliable\nmodels of human driving behavior in simulation. This work presents an approach\nto learn neural driving policies from real world driving demonstration data. We\nmodel human driving as a sequential decision making problem that is\ncharacterized by non-linearity and stochasticity, and unknown underlying cost\nfunctions. Imitation learning is an approach for generating intelligent\nbehavior when the cost function is unknown or difficult to specify. Building\nupon work in inverse reinforcement learning (IRL), Generative Adversarial\nImitation Learning (GAIL) aims to provide effective imitation even for problems\nwith large or continuous state and action spaces, such as modeling human\ndriving. This article describes the use of GAIL for learning-based driver\nmodeling. Because driver modeling is inherently a multi-agent problem, where\nthe interaction between agents needs to be modeled, this paper describes a\nparameter-sharing extension of GAIL called PS-GAIL to tackle multi-agent driver\nmodeling. In addition, GAIL is domain agnostic, making it difficult to encode\nspecific knowledge relevant to driving in the learning process. This paper\ndescribes Reward Augmented Imitation Learning (RAIL), which modifies the reward\nsignal to provide domain-specific knowledge to the agent. Finally, human\ndemonstrations are dependent upon latent factors that may not be captured by\nGAIL. This paper describes Burn-InfoGAIL, which allows for disentanglement of\nlatent variability in demonstrations. Imitation learning experiments are\nperformed using NGSIM, a real-world highway driving dataset. Experiments show\nthat these modifications to GAIL can successfully model highway driving\nbehavior, accurately replicating human demonstrations and generating realistic,\nemergent behavior in the traffic flow arising from the interaction between\ndriving agents.\n

Keywords:

Imitation Reinforcement learning Computer science Artificial intelligence Machine learning ENCODE Generative grammar Domain (mathematical analysis) Action (physics) Adversarial system Human–computer interaction

Metrics

108

Cited By

10.20

FWCI (Field Weighted Citation Impact)

Refs

0.98

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Autonomous Vehicle Technology and Safety

Physical Sciences → Engineering → Automotive Engineering

Traffic control and management

Physical Sciences → Engineering → Control and Systems Engineering

Traffic and Road Safety

Physical Sciences → Engineering → Safety, Risk, Reliability and Quality

Modeling Human Driving Behavior Through Generative Adversarial Imitation Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Modelling Human Driving Behaviour Through Generative Adversarial Imitation Learning

Modelling Human Driving Behaviour Through Generative Adversarial Imitation Learning

Customized Generative Adversarial Imitation Learning for Driving Behavior Modeling in Traffic Simulation

Generative Adversarial Imitation Learning from Human Behavior with Reward Shaping

Modeling Car-Following Behaviors and Driving Styles with Generative Adversarial Imitation Learning