Home

Awesome

<div align="center"> <h1>Awesome LLM Compression</h1> <a href="https://awesome.re"><img src="https://awesome.re/badge.svg"/></a> <img src=https://img.shields.io/github/stars/HuangOwen/Awesome-LLM-Compression.svg?style=social > <img src=https://img.shields.io/github/watchers/HuangOwen/Awesome-LLM-Compression.svg?style=social > </div>

Awesome LLM compression research papers and tools to accelerate LLM training and inference.

Contents

Papers

Survey

Quantization

Pruning and Sparsity

Distillation

Efficient Prompting

KV Cache Compression

Other

Tools

Contributing

This is an active repository and your contributions are always welcome! Before you add papers/tools into the awesome list, please make sure that:

Thanks again for all the awesome contributors to this list!

<a href="https://github.com/HuangOwen/Awesome-LLM-Compression/graphs/contributors"><img src="https://contrib.rocks/image?repo=HuangOwen/Awesome-LLM-Compression&max=240&columns=12" /></a>

Star History

Star History Chart