Home

Awesome

Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models

Set up

Environment & Data: sh setup.sh

For math QA data, download the raw file from here and place it in the ./data/ folder

Set your path to HF cache directory in each of the bash files below. E.g.,

Usage

Running RTN-ada

Running GPTQ-ada

Citation

If you find AdaDim helpful or relevant, please kindly cite our paper:

@inproceedings{
heo2024rethinking,
title={Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models},
author={Jung Hwan Heo and Jeonghoon Kim and Beomseok Kwon and Byeongwook Kim and Se Jung Kwon and Dongsoo Lee},
booktitle={The Twelfth International Conference on Learning Representations},
year={2024},
url={https://openreview.net/forum?id=JzG7kSpjJk}
}

Acknowledgements

This code base is expanded upon wonderful works from