Home

Awesome

<p align="center" width="100%"> </p> <div id="top" align="center">

Knowledge Fusion of Large Language Models

<img src="https://img.shields.io/badge/Version-1.0.0-blue.svg" alt="Version"> <img src="https://img.shields.io/badge/License-Apache_2.0-green.svg" alt="License"> <img src="https://img.shields.io/github/stars/fanqiwan/FuseLLM?color=yellow" alt="Stars"> <img src="https://img.shields.io/github/issues/fanqiwan/FuseLLM?color=red" alt="Issues"> <h4> |<a href="https://arxiv.org/abs/2401.10491"> πŸ“‘ FuseLLM Paper @ICLR2024 </a> | <a href="https://arxiv.org/abs/2408.07990"> πŸ“‘ FuseChat Tech Report </a> | <a href="https://arxiv.org/abs/2412.03187"> πŸ“‘ WRPO Tech Report </a> | <a href="https://slit-ai.github.io/FuseChat-3.0/"> 🌐 Website </a> |

|<a href="https://huggingface.co/FuseAI"> πŸ€— HuggingFace Repo </a> | <a href="https://github.com/fanqiwan/FuseLLM"> 🐱 GitHub Repo </a> |

</h4> <p align="center"> <img src="./assets/logo.png" width="95%"> <br> </p> </div>

News

FuseChat-3.0 [SOTA 8B LLM on AlpacaEval-2 & Arena-Hard]

<p align="center"> <img src="./FuseChat-3.0/assets/FuseChat-3.0.png" width="60%"> <br> </p>

FuseChat [SOTA 7B LLM on MT-Bench]

<p align="center"> <img src="./FuseChat/assets/fig1.png" width="60%"> <br> </p> <p align="center"> <img src="./FuseChat/assets/tab1.png" width="95%"> <br> </p>

FuseLLM [Surpassing Llama-2-7B]

<p align="center"> <img src="./FuseLLM/assets/fig0.png" width="95%"> <br> </p> <p align="center"> <img src="./FuseLLM/assets/fig1.png" width="95%"> <br> </p>

Citation

Please cite the following paper if you reference our model, code, data, or paper related to FuseLLM.

@inproceedings{wan2024knowledge,
  title={Knowledge Fusion of Large Language Models},
  author={Fanqi Wan and Xinting Huang and Deng Cai and Xiaojun Quan and Wei Bi and Shuming Shi},
  booktitle={The Twelfth International Conference on Learning Representations},
  year={2024},
  url={https://openreview.net/pdf?id=jiDsk12qcz}
}

Please cite the following paper if you reference our model, code, data, or paper related to FuseChat.

@article{wan2024fusechat,
  title={FuseChat: Knowledge Fusion of Chat Models},
  author={Fanqi Wan and Longguang Zhong and Ziyi Yang and Ruijun Chen and Xiaojun Quan},
  journal={arXiv preprint arXiv:2408.07990},
  year={2024}
}

Please cite the following paper if you reference our model, code, data, or paper related to WRPO.

@article{yang2024wrpo,
  title={Weighted-Reward Preference Optimization for Implicit Model Fusion},
  author={Ziyi Yang and Fanqi Wan and Longguang Zhong and Tianyuan Shi and Xiaojun Quan},
  journal={arXiv preprint arXiv:2412.03187},
  year={2024}
}

Star History

Star History Chart