arxiv:2406.15349

NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking

Published on Jun 21

· Submitted by

kashyap7x on Jun 24

Upvote

Authors:

Daniel Dauner ,

Marcel Hallgarten ,

Zetong Yang ,

Igor Gilitschenski ,

Andreas Geiger ,

Kashyap Chitta

Abstract

Benchmarking vision-based driving policies is challenging. On one hand, open-loop evaluation with real data is easy, but these results do not reflect closed-loop performance. On the other, closed-loop evaluation is possible in simulation, but is hard to scale due to its significant computational demands. Further, the simulators available today exhibit a large domain gap to real data. This has resulted in an inability to draw clear conclusions from the rapidly growing body of research on end-to-end autonomous driving. In this paper, we present NAVSIM, a middle ground between these evaluation paradigms, where we use large datasets in combination with a non-reactive simulator to enable large-scale real-world benchmarking. Specifically, we gather simulation-based metrics, such as progress and time to collision, by unrolling bird's eye view abstractions of the test scenes for a short simulation horizon. Our simulation is non-reactive, i.e., the evaluated policy and environment do not influence each other. As we demonstrate empirically, this decoupling allows open-loop metric computation while being better aligned with closed-loop evaluations than traditional displacement errors. NAVSIM enabled a new competition held at CVPR 2024, where 143 teams submitted 463 entries, resulting in several new insights. On a large set of challenging scenarios, we observe that simple methods with moderate compute requirements such as TransFuser can match recent large-scale end-to-end driving architectures such as UniAD. Our modular framework can potentially be extended with new datasets, data curation strategies, and metrics, and will be continually maintained to host future challenges. Our code is available at https://github.com/autonomousvision/navsim.

View arXiv page View PDF Add to collection

Community

kashyap7x

Paper author Paper submitter Jun 24

Which trajectory is best? Surprisingly, red (colliding with the vehicle on the right) has the lowest displacement error w.r.t. the human driver’s reference behavior! Our simulator NAVSIM helps address these complexities in autonomous vehicle benchmarking. It was designed for the CVPR autonomous grand challenge, with an official HuggingFace evaluation server: https://huggingface.co/spaces/AGC2024-P/e2e-driving-2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 1

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2406.15349 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2406.15349 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.