Awesome
<p align="center"> <h2 align="center">EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting</h2> <h5 align="center">International Conference on 3D Vision (3DV) 2025</h5> </p> <div align="center">Project Page | Paper | Video | Data
<img src="assets/egogs.gif"> </div>Updates
- <b>[12/01/2024]</b> Initial code release
📝 TODO List
- [x] Release code of EgoGaussian
- [x] Release 3DGS-ready egocentric data we processed from EPIC-KITCHENS, HOI4D, and EPIC Fields. Please also consider citing their great works if you use this subset 🤗
- [ ] Upload pre-trained checkpoints for quick evaluation and visualization
- [ ] EgoGaussian viewer
- [ ] Pipeline optimization
- [ ] Tutorial for running EgoGaussian on customized data
🛠️ Setup
The setup should be very similar to the original 3D Gaussian Splatting except we used a modified version of differential gaussian rasterization with support of depth & alpha rendering. We will release the requirements.txt
later.
Overview
The full EgoGaussian pipelie consists of 4 main stages corresponding to different scripts under trainers
- Static object & background initialization
- Coarse object pose estimation
- Fine-tuning object pose & shape
- Fine-tuning full dynamic scene
Quick start
You can use the following script to run a full EgoGaussian pipeline from scratch on the provided data.
sbatch train.sh
Reproducing the results
You can also skip the training and directly reproducing the results of Table 1 in our paper and videos on the webpage by running the following script with the checkpoints we provide.
DATA_TYPE=EK # or HOI
DATA_NAME=P03_03 # or Video0
RUN_NAME=full
python eval.py \
--source_path ${DATA_DIR}/${DATA_TYPE}/${DATA_NAME} \
--out_root ${OUT_DIR} \
--data_type ${DATA_TYPE} \
--video ${DATA_NAME} \
--run_name ${RUN_NAME} \
Acknowledgement
Our implementation is heavily based on the original 3D Gaussian Splatting. We thank the authors for their revolutionary work and open-source contributions.
Citation
If you find our paper useful, please cite us:
@misc{zhang2024egogaussiandynamicsceneunderstanding,
title={EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting},
author={Daiwei Zhang and Gengyan Li and Jiajie Li and Mickaël Bressieux and Otmar Hilliges and Marc Pollefeys and Luc Van Gool and Xi Wang},
year={2024},
eprint={2406.19811},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2406.19811},
}