Awesome
[ECCV'24] HAC
Official Pytorch implementation of HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression.
Compress 3D Gaussian Splatting for 75X without fidelity drop!
Yihang Chen, Qianyi Wu, Weiyao Lin, Mehrtash Harandi, Jianfei Cai
[Paper
] [Arxiv
] [Project
] [Github
]
Links
Welcome to check a series of works from our group on 3D radiance field representation compression as listed below:
- π CNC [CVPR'24] is now released for efficient NeRF compression! [
Paper
] [Arxiv
] [Project
] - π HAC [ECCV'24] is now released for efficient 3DGS compression! [
Paper
]Arxiv
] [Project
] - π FCGS [ARXIV'24] is now released for fast optimization-free 3DGS compression! [
Arxiv
] [Project
]
Updates
π₯8-Aug-2024: HAC now utilizes a cuda-based codec
instead of the original torchac
, which significantly reduces the codec runtime by over 10
times compared to that reported in the paper!
Overview
<p align="left"> <img src="assets/teaser.png" width=80% height=80% class="center"> </p>Our approach introduces a binary hash grid to establish continuous spatial consistencies, allowing us to unveil the inherent spatial relations of anchors through a carefully designed context model. To facilitate entropy coding, we utilize Gaussian distributions to accurately estimate the probability of each quantized attribute, where an adaptive quantization module is proposed to enable high-precision quantization of these attributes for improved fidelity restoration. Additionally, we incorporate an adaptive masking strategy to eliminate invalid Gaussians and anchors. Importantly, our work is the pioneer to explore context-based compression for 3DGS representation, resulting in a remarkable size reduction.
Performance
<p align="left"> <img src="assets/main_performance.png" width=80% height=80% class="center"> </p>Installation
We tested our code on a server with Ubuntu 20.04.1, cuda 11.8, gcc 9.4.0
- Unzip files
cd submodules
unzip diff-gaussian-rasterization.zip
unzip gridencoder.zip
unzip simple-knn.zip
unzip arithmetic.zip
cd ..
- Install environment
conda env create --file environment.yml
conda activate HAC_env
Data
First, create a data/
folder inside the project path by
mkdir data
The data structure will be organised as follows:
data/
βββ dataset_name
βΒ Β βββ scene1/
βΒ Β βΒ Β βββ images
βΒ Β βΒ Β βΒ Β βββ IMG_0.jpg
βΒ Β βΒ Β βΒ Β βββ IMG_1.jpg
βΒ Β βΒ Β βΒ Β βββ ...
βΒ Β βΒ Β βββ sparse/
βΒ Β βΒ Β βββ0/
βΒ Β βββ scene2/
βΒ Β βΒ Β βββ images
βΒ Β βΒ Β βΒ Β βββ IMG_0.jpg
βΒ Β βΒ Β βΒ Β βββ IMG_1.jpg
βΒ Β βΒ Β βΒ Β βββ ...
βΒ Β βΒ Β βββ sparse/
βΒ Β βΒ Β βββ0/
...
- For instance:
./data/blending/drjohnson/
- For instance:
./data/bungeenerf/amsterdam/
- For instance:
./data/mipnerf360/bicycle/
- For instance:
./data/nerf_synthetic/chair/
- For instance:
./data/tandt/train/
Public Data (We follow suggestions from Scaffold-GS)
- The BungeeNeRF dataset is available in Google Drive/ηΎεΊ¦η½η[ζεη :4whv].
- The MipNeRF360 scenes are provided by the paper author here. And we test on its entire 9 scenes
bicycle, bonsai, counter, garden, kitchen, room, stump, flowers, treehill
. - The SfM datasets for Tanks&Temples and Deep Blending are hosted by 3D-Gaussian-Splatting here. Download and uncompress them into the
data/
folder.
Custom Data
For custom data, you should process the image sequences with Colmap to obtain the SfM points and camera poses. Then, place the results into data/
folder.
Training
To train scenes, we provide the following training scripts:
- Tanks&Temples:
run_shell_tnt.py
- MipNeRF360:
run_shell_mip360.py
- BungeeNeRF:
run_shell_bungee.py
- Deep Blending:
run_shell_db.py
- Nerf Synthetic:
run_shell_blender.py
run them with
python run_shell_xxx.py
The code will automatically run the entire process of: training, encoding, decoding, testing.
- Training log will be recorded in
output.log
of the output directory. Results of detailed fidelity, detailed size, detailed time will all be recorded - Encoded bitstreams will be stored in
./bitstreams
of the output directory. - Evaluated output images will be saved in
./test/ours_30000/renders
of the output directory. - Optionally, you can change
lmbda
in theserun_shell_xxx.py
scripts to try variable bitrate. - After training, the original model
point_cloud.ply
is losslessly compressed as./bitstreams
. You should refer to./bitstreams
to get the final model size, but notpoint_cloud.ply
. You can even deletepoint_cloud.ply
if you like :).
Contact
- Yihang Chen: yhchen.ee@sjtu.edu.cn
Citation
If you find our work helpful, please consider citing:
@inproceedings{hac2024,
title={HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression},
author={Chen, Yihang and Wu, Qianyi and Lin, Weiyao and Harandi, Mehrtash and Cai, Jianfei},
booktitle={European Conference on Computer Vision},
year={2024}
}
LICENSE
Please follow the LICENSE of 3D-GS.
Acknowledgement
- We thank all authors from 3D-GS for presenting such an excellent work.
- We thank all authors from Scaffold-GS for presenting such an excellent work.