Framework Overview
The overview figure presents our proposed framework, DriveEnv-NeRF, which is designed to create highly realistic 3D simulations of a given environment. Initially, we capture videos of the target scene under various lighting conditions to ensure a comprehensive representation. These videos are processed to extract frames and generate camera parameters using structure-from-motion techniques. Following this, a NeRF model is trained with these inputs to produce novel view renderings and 3D meshes of the scene. These elements serve as the basis for visual observation and physical interactions within the simulation environment. Appearance embeddings are incorporated into the NeRF model to enhance the realism and variability of the environment. With DriveEnv-NeRF developed, the framework allows for the full evaluation and training of a driving policy within this environment. This framework enables the prediction of real-world performance and the application of the trained driving policy to real-world scenarios directly.