Home

Awesome

<div align="center"> <h1 align="center">The default model repository of <a href="https://github.com/bentoml/openllm">openllm</a></h1> </div>

This repo (on main branch) is already included by openllm by default.

If you want more up-to-date untested models, please add our nightly branch.

openllm repo add nightly https://github.com/bentoml/openllm-models@nightly

Supported Models

Table of Contents


Llama-3.1 <a id="llama3.1"></a>

ModelVersionHuggingface Link
llama3.1405b-instruct-awq-4bit-df2aHF Link
llama3.170b-instruct-awq-4bit-2988HF Link
llama3.170b-instruct-fp16-ace8HF Link
llama3.18b-instruct-awq-4bit-fe8cHF Link
llama3.18b-instruct-fp16-2f36HF Link

Llama-3 <a id="llama3"></a>

ModelVersionHuggingface Link
llama370b-instruct-awq-4bit-aebbHF Link
llama370b-instruct-fp16-1315HF Link
llama38b-instruct-awq-4bit-3f34HF Link
llama38b-instruct-fp16-8f83HF Link

Phi-3 <a id="phi3"></a>

ModelVersionHuggingface Link
phi33.8b-instruct-fp16-166cHF Link
phi33.8b-instruct-ggml-q4-76aaHF Link

Mistral <a id="mistral"></a>

ModelVersionHuggingface Link
mistral24b-instruct-nemo-ec54HF Link
mistral7b-instruct-awq-4bit-01cdHF Link
mistral7b-instruct-fp16-e1cdHF Link

Gemma-2 <a id="gemma2"></a>

ModelVersionHuggingface Link
gemma227b-instruct-fp16-6b83HF Link
gemma29b-instruct-fp16-6e86HF Link

Qwen-2 <a id="qwen2"></a>

ModelVersionHuggingface Link
qwen20.5b-instruct-fp16-33dfHF Link
qwen21.5b-instruct-fp16-7cdaHF Link
qwen257b-a14b-instruct-fp16-365fHF Link
qwen272b-instruct-awq-4bit-33faHF Link
qwen272b-instruct-fp16-8cb4HF Link
qwen27b-instruct-awq-4bit-14aaHF Link
qwen27b-instruct-fp16-bbf2HF Link

Gemma <a id="gemma"></a>

ModelVersionHuggingface Link
gemma2b-instruct-fp16-6ee7HF Link
gemma7b-instruct-awq-4bit-df0bHF Link
gemma7b-instruct-fp16-2297HF Link

Llama-2 <a id="llama2"></a>

ModelVersionHuggingface Link
llama213b-chat-fp16-ef61HF Link
llama270b-chat-fp16-16a0HF Link
llama27b-chat-awq-4bit-4f93HF Link
llama27b-chat-fp16-21b9HF Link

Mixtral <a id="mixtral"></a>

ModelVersionHuggingface Link
mixtral8x7b-instruct-v0.1-awq-4bit-06fdHF Link
mixtral8x7b-instruct-v0.1-fp16-e289HF Link

Mistral-Large <a id="mistral-large"></a>

ModelVersionHuggingface Link
mistral-large123b-instruct-awq-4bit-1d37HF Link
mistral-large123b-instruct-fp16-5c96HF Link

Codestral <a id="codestral"></a>

ModelVersionHuggingface Link
codestral22b-v0.1-fp16-b677HF Link