Home

Awesome

<h1> Foundations of Large Language Model Compression—Part 1: Weight Quantization

arXiv License:MIT

<img width="783" alt="Screenshot 2024-09-04 at 9 15 57 AM" src="https://github.com/user-attachments/assets/6a3385e2-9c43-425d-b9db-914c11c85648"> </h1>

This is the official repo for the paper "Foundations of Large Language Model Compression—Part 1: Weight Quantization" by Sean I Young. Code will be available here soon.

<br/> <img width="783" alt="Screenshot 2024-09-04 at 9 09 48 AM" src="https://github.com/user-attachments/assets/ef9f6f0c-f32d-4f13-a951-36a7a043d974">