Home

Awesome

About

This repo contains the source code and results in the ICCV 2023 paper Computationally-Efficient Neural Image Compression with Shallow Decoders (formerly titled "Asymmetrically-powered Neural Image Compression with Shallow Decoders").

shallow ntc summarized in one picture

As far as we know, this is the first neural image compression method to operate with a decoding budget of less than 50K FLOPs/pixel while achieving rate-distortion performance competitive with BPG, when measured in PSNR and BPP.

UPDATE [Nov 1, 2023] Added pretrained checkpoints; download them here. See notebooks/vis_syn_filters.ipynb for an example for how to load a model and visualize the learned filters of the synthesis transform.

Software

Setup

Training

Here's an example command to train a factorized prior model, with default hyperparameter settings defined in a config file (run from the project root dir):

python -m factorized.train --config factorized/configs/bls2017.py --alsologtostderr

Here's another example for training the two-layer synthesis + ELIC analysis model:

python -m mshyper.train --config mshyper/configs/two_layer_syn.py --alsologtostderr

Here's an example command to launch Slurm training jobs with a hyperparameter sweep defined in the config file:

./launch.py --main mshyper.train --config mshyper/configs/two_layer_syn.py

By default, the experiment output will be written to project_dir/train_xms. Each experiment is identified by a unique id ("xid"; this is a string of number assigned by Slurm or a timestamp when not running with Slurm); within each experiment there can be multiple runs, each run on a different work unit with its own integer id ("wid"; this is simply the integer index of the hyperparameter setting being used in the hyperparameter sweep (defined by get_hyper in the config file)). The output of each work unit will be stored in project_dir/train_xms/xid/run_dir, where run_dir consists of a work unit id (like "wid=2") and a "runname" automatically generated from the hyperparameter setting (e.g., "num_filters=256-lr=0.001").

The logs and intermediate reconstructions can be viewed on tensorboard (just run the usual tensorboard --logdir your_workdir command).

Below is a list of model architectures from the paper and the corresponding "config" files for training experiments:

ModelConfig file
Ballé et al. 2017factorized/configs/bls2017.py
Minnen et al. 2018 (mean-scale Hyperprior)<br/>mshyper/configs/mbt2018.py
Proposed JPEG-like (one-layer) synthesis + ELIC analysismshyper/configs/jpegl.py
Proposed two-layer synthesis + ELIC analysismshyper/configs/two_layer_syn.py
No-residual two-layer synthesis + CNN analysis + mixed quantization training*mshyper/configs/two_layer_syn2.py

*In additional experiments we found this combination to give further improved R-D performance than the proposed two-layer synthesis + ELIC analysis.

Some pretrained checkpoints can be found here. See an example of working with a checkpoint here.

Evaluation

Here's an example command to launch Slurm jobs for evaluating trained checkpoints, on a dataset called $dataset_name:

./launch_eval.py --workdirs 'train_xms/188188/wid=*' --dataset $dataset_name

where train_xms/188188/wid=* is an example glob pattern that matches the workdirs of the runs to be evaluated. The results will be written to json_results/$dataset_name/end_to_end. Additional flags for eval.py can be passed to launch_eval.py via --args (e.g., controlling where results are stored and whether to skip eval when there are existing result files).

Iterative Inference

Here's an example command to launch Slurm jobs running iterative inference on pretrained checkpoints that use the hyperprior entropy model (substitute factorized for mshyper for models that use the factorized prior):

./launch.py --main mshyper.itinf --config mshyper/configs/itinf.py

This is basically the same as the train command; see details in the mshyper/configs/itinf.py config file.

The results will be stored in project_dir/itinf_xms/, with a similar naming convention/organization to the model training experiments.

You can find a demo of SGA here.