DriveEnv-NeRF

Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation


Mu-Yi Shen1*, Chia-Chi Hsu1*, Hao-Yu Hou1*, Yu-Chen Huang1*, Wei-Fang Sun2,
Chia-Che Chang3, Yu-Lun Liu4, Chun-Yi Lee1,
1Elsa Lab, National Tsing Hua University, 2NVIDIA AI Technology Center, NVIDIA Corporation, 3MediaTek Inc., 4National Yang Ming Chiao Tung University, *Equal Contribution

Video

Abstract

In this study, we introduce the DriveEnv-NeRF framework, which leverages Neural Radiance Fields (NeRF) to enable the validation and faithful forecasting of the efficacy of autonomous driving agents in a targeted real-world scene. Standard simulator-based rendering often fails to accurately reflect real-world performance due to the sim-to-real gap, which represents the disparity between virtual simulations and real-world conditions. To mitigate this gap, we propose a workflow for building a high-fidelity simulation environment of the targeted real-world scene using NeRF. This approach is capable of rendering realistic images from novel viewpoints and constructing 3D meshes for emulating collisions. The validation of these capabilities through the comparison of success rates in both simulated and real environments demonstrates the benefits of using DriveEnv-NeRF as a real-world performance indicator. Furthermore, the DriveEnv-NeRF framework can serve as a training environment for autonomous driving agents under various lighting conditions. This approach enhances the robustness of the agents and reduces performance degradation when deployed to the target real scene, compared to agents fully trained using the standard simulator rendering pipeline.

Framework Overview

The overview figure presents our proposed framework, DriveEnv-NeRF, which is designed to create highly realistic 3D simulations of a given environment. Initially, we capture videos of the target scene under various lighting conditions to ensure a comprehensive representation. These videos are processed to extract frames and generate camera parameters using structure-from-motion techniques. Following this, a NeRF model is trained with these inputs to produce novel view renderings and 3D meshes of the scene. These elements serve as the basis for visual observation and physical interactions within the simulation environment. Appearance embeddings are incorporated into the NeRF model to enhance the realism and variability of the environment. With DriveEnv-NeRF developed, the framework allows for the full evaluation and training of a driving policy within this environment. This framework enables the prediction of real-world performance and the application of the trained driving policy to real-world scenarios directly.

Framework Overview

Variable Lighting

Environments with various illuminations can be created by controlling appearance embedding. Here are two examples of lighting transformations, which are from day to night and from cloudy to sunny.

Day → Night

Cloudy → Sunny

Appearance Interpolation

Interpolating between two appearance embeddings can create a transitional lighting.

☀️

Loading...

🌙


Sim2Real

In addition, DriveEnv-NeRF can function as a training environment for RL driving agents with only raw image inputs. Training with varying lighting conditions enables the agents to adapt to different real-world scenarios. These RL agents can be directly deployed to the real-world, maintaining high success rates in the experiments.

Day

Night

BibTeX


@inproceedings{
  shen2024driveenvnerf,
  title={DriveEnv-Ne{RF}: Exploration of A Ne{RF}-Based Autonomous Driving Environment for Real-World Performance Validation},
  author={MuYi Shen and Chia-Chi Hsu and Hao-Yu Hou and Yu Chen Huang and Wei-Fang Sun and Chia-Che Chang and Yu-Lun Liu and Chun-Yi Lee},
  booktitle={RoboNerF: 1st Workshop On Neural Fields In Robotics at ICRA 2024},
  year={2024},
  url={https://openreview.net/forum?id=zuLWciW556}
}
        

Open-Source Implementations

We've released a portion of our implementation code in Isaac Sim and Isaac ROS:
Stay tuned for additional releases in the future.