bert | bert-base-chinese | google-bert | bert-base-chinese | bert-base-chinese |
| chinese_L-12_H-768_A-12 | 谷歌 | tf权重<br>Tongjilibo/bert-chinese_L-12_H-768_A-12 | |
| chinese-bert-wwm-ext | HFL | hfl/chinese-bert-wwm-ext | hfl/chinese-bert-wwm-ext |
| bert-base-multilingual-cased | google-bert | bert-base-multilingual-cased | bert-base-multilingual-cased |
| MacBERT | HFL | hfl/chinese-macbert-base <br>hfl/chinese-macbert-large | hfl/chinese-macbert-base <br>hfl/chinese-macbert-large |
| WoBERT | 追一科技 | junnyu/wobert_chinese_base ,junnyu/wobert_chinese_plus_base | junnyu/wobert_chinese_base <br>junnyu/wobert_chinese_plus_base |
roberta | chinese-roberta-wwm-ext | HFL | hfl/chinese-roberta-wwm-ext <br>hfl/chinese-roberta-wwm-ext-large <br>(large的mlm权重是随机初始化) | hfl/chinese-roberta-wwm-ext <br>hfl/chinese-roberta-wwm-ext-large |
| roberta-small/tiny | 追一科技 | Tongjilibo/chinese_roberta_L-4_H-312_A-12 <br>Tongjilibo/chinese_roberta_L-6_H-384_A-12 | |
| roberta-base | FacebookAI | roberta-base | roberta-base |
| guwenbert | ethanyt | ethanyt/guwenbert-base | ethanyt/guwenbert-base |
albert | albert_zh<br>albert_pytorch | brightmart | voidful/albert_chinese_tiny <br>voidful/albert_chinese_small <br>voidful/albert_chinese_base <br>voidful/albert_chinese_large <br>voidful/albert_chinese_xlarge <br>voidful/albert_chinese_xxlarge | voidful/albert_chinese_tiny <br>voidful/albert_chinese_small <br>voidful/albert_chinese_base <br>voidful/albert_chinese_large <br>voidful/albert_chinese_xlarge <br>voidful/albert_chinese_xxlarge |
nezha | NEZHA<br>NeZha_Chinese_PyTorch | huawei_noah | sijunhe/nezha-cn-base <br>sijunhe/nezha-cn-large <br>sijunhe/nezha-base-wwm <br>sijunhe/nezha-large-wwm | sijunhe/nezha-cn-base <br>sijunhe/nezha-cn-large <br>sijunhe/nezha-base-wwm <br>sijunhe/nezha-large-wwm |
| nezha_gpt_dialog | bojone | Tongjilibo/nezha_gpt_dialog | |
xlnet | Chinese-XLNet | HFL | hfl/chinese-xlnet-base | hfl/chinese-xlnet-base |
| tranformer_xl | huggingface | transfo-xl/transfo-xl-wt103 | transfo-xl/transfo-xl-wt103 |
deberta | Erlangshen-DeBERTa-v2 | IDEA | IDEA-CCNL/Erlangshen-DeBERTa-v2-97M-Chinese <br>IDEA-CCNL/Erlangshen-DeBERTa-v2-320M-Chinese <br>IDEA-CCNL/Erlangshen-DeBERTa-v2-710M-Chinese | IDEA-CCNL/Erlangshen-DeBERTa-v2-97M-Chinese <br>IDEA-CCNL/Erlangshen-DeBERTa-v2-320M-Chinese <br>IDEA-CCNL/Erlangshen-DeBERTa-v2-710M-Chinese |
electra | Chinese-ELECTRA | HFL | hfl/chinese-electra-base-discriminator | hfl/chinese-electra-base-discriminator |
ernie | ernie | 百度文心 | nghuyong/ernie-1.0-base-zh <br>nghuyong/ernie-3.0-base-zh | nghuyong/ernie-1.0-base-zh <br>nghuyong/ernie-3.0-base-zh |
roformer | roformer | 追一科技 | junnyu/roformer_chinese_base | junnyu/roformer_chinese_base |
| roformer_v2 | 追一科技 | junnyu/roformer_v2_chinese_char_base | junnyu/roformer_v2_chinese_char_base |
simbert | simbert | 追一科技 | Tongjilibo/simbert-chinese-base <br>Tongjilibo/simbert-chinese-small <br>Tongjilibo/simbert-chinese-tiny | |
| simbert_v2/roformer-sim | 追一科技 | junnyu/roformer_chinese_sim_char_base ,junnyu/roformer_chinese_sim_char_ft_base ,junnyu/roformer_chinese_sim_char_small ,junnyu/roformer_chinese_sim_char_ft_small | junnyu/roformer_chinese_sim_char_base <br>junnyu/roformer_chinese_sim_char_ft_base <br>junnyu/roformer_chinese_sim_char_small <br>junnyu/roformer_chinese_sim_char_ft_small |
gau | GAU-alpha | 追一科技 | Tongjilibo/chinese_GAU-alpha-char_L-24_H-768 | |
uie | uie<br>uie_pytorch | 百度 | Tongjilibo/uie-base | |
gpt | CDial-GPT | thu-coai | thu-coai/CDial-GPT_LCCC-base <br>thu-coai/CDial-GPT_LCCC-large | thu-coai/CDial-GPT_LCCC-base <br>thu-coai/CDial-GPT_LCCC-large |
| cmp_lm(26亿) | 清华 | TsinghuaAI/CPM-Generate | TsinghuaAI/CPM-Generate |
| nezha_gen | huawei_noah | Tongjilibo/chinese_nezha_gpt_L-12_H-768_A-12 | |
| gpt2-chinese-cluecorpussmall | UER | uer/gpt2-chinese-cluecorpussmall | uer/gpt2-chinese-cluecorpussmall |
| gpt2-ml | imcaspar | torch<br>BaiduYun(84dh) | gpt2-ml_15g_corpus <br>gpt2-ml_30g_corpus |
bart | bart_base_chinese | 复旦fnlp | fnlp/bart-base-chinese <br>v1.0 | fnlp/bart-base-chinese <br>fnlp/bart-base-chinese-v1.0 |
t5 | t5 | UER | uer/t5-small-chinese-cluecorpussmall <br>uer/t5-base-chinese-cluecorpussmall | uer/t5-base-chinese-cluecorpussmall <br>uer/t5-small-chinese-cluecorpussmall |
| mt5 | 谷歌 | google/mt5-base | google/mt5-base |
| t5_pegasus | 追一科技 | Tongjilibo/chinese_t5_pegasus_small <br>Tongjilibo/chinese_t5_pegasus_base | |
| chatyuan | clue-ai | ClueAI/ChatYuan-large-v1 <br>ClueAI/ChatYuan-large-v2 | ClueAI/ChatYuan-large-v1 <br>ClueAI/ChatYuan-large-v2 |
| PromptCLUE | clue-ai | ClueAI/PromptCLUE-base | ClueAI/PromptCLUE-base |
chatglm | chatglm-6b | THUDM | THUDM/chatglm-6b <br>THUDM/chatglm-6b-int8 <br>THUDM/chatglm-6b-int4 <br>v0.1.0 | THUDM/chatglm-6b <br>THUDM/chatglm-6b-int8 <br>THUDM/chatglm-6b-int4 <br>THUDM/chatglm-6b-v0.1.0 |
| chatglm2-6b | THUDM | THUDM/chatglm2-6b <br>THUDM/chatglm2-6b-int4 <br>THUDM/chatglm2-6b-32k | THUDM/chatglm2-6b <br>THUDM/chatglm2-6b-int4 <br>THUDM/chatglm2-6b-32k |
| chatglm3-6b | THUDM | THUDM/chatglm3-6b <br>THUDM/chatglm3-6b-32k | THUDM/chatglm3-6b <br>THUDM/chatglm3-6b-32k |
| glm4-9b | THUDM | THUDM/glm-4-9b <br>THUDM/glm-4-9b-chat <br>THUDM/glm-4-9b-chat-1m | THUDM/glm-4-9b <br>THUDM/glm-4-9b-chat <br>THUDM/glm-4-9b-chat-1m |
llama | llama | meta | | meta-llama/llama-7b <br>meta-llama/llama-13b |
| llama-2 | meta | meta-llama/Llama-2-7b-hf<br>meta-llama/Llama-2-7b-chat-hf<br>meta-llama/Llama-2-13b-hf<br>meta-llama/Llama-2-13b-chat-hf | meta-llama/Llama-2-7b-hf <br>meta-llama/Llama-2-7b-chat-hf <br>meta-llama/Llama-2-13b-hf <br>meta-llama/Llama-2-13b-chat-hf |
| llama-3 | meta | meta-llama/Meta-Llama-3-8B <br>meta-llama/Meta-Llama-3-8B-Instruct | meta-llama/Meta-Llama-3-8B <br>meta-llama/Meta-Llama-3-8B-Instruct |
| llama-3.1 | meta | meta-llama/Meta-Llama-3.1-8B <br>meta-llama/Meta-Llama-3.1-8B-Instruct | meta-llama/Meta-Llama-3.1-8B <br>meta-llama/Meta-Llama-3.1-8B-Instruct |
| llama-3.2 | meta | meta-llama/Llama-3.2-1B <br>meta-llama/Llama-3.2-1B-Instruct <br>meta-llama/Llama-3.2-3B <br>meta-llama/Llama-3.2-3B-Instruct | meta-llama/Llama-3.2-1B <br>meta-llama/Llama-3.2-1B-Instruct <br>meta-llama/Llama-3.2-3B <br>meta-llama/Llama-3.2-3B-Instruct |
| Chinese-LLaMA-Alpaca | HFL | | hfl/chinese_alpaca_plus_7b <br>hfl/chinese_llama_plus_7b |
| Chinese-LLaMA-Alpaca-2 | HFL | | 待添加 |
| Chinese-LLaMA-Alpaca-3 | HFL | | 待添加 |
| Belle_llama | LianjiaTech | BelleGroup/BELLE-LLaMA-7B-2M-enc | 合成说明、BelleGroup/BELLE-LLaMA-7B-2M-enc |
| Ziya | IDEA-CCNL | IDEA-CCNL/Ziya-LLaMA-13B-v1<br>IDEA-CCNL/Ziya-LLaMA-13B-v1.1<br>IDEA-CCNL/Ziya-LLaMA-13B-Pretrain-v1 | IDEA-CCNL/Ziya-LLaMA-13B-v1 <br>IDEA-CCNL/Ziya-LLaMA-13B-v1.1 |
| vicuna | lmsys | lmsys/vicuna-7b-v1.5 | lmsys/vicuna-7b-v1.5 |
Baichuan | Baichuan | baichuan-inc | baichuan-inc/Baichuan-7B <br>baichuan-inc/Baichuan-13B-Base <br>baichuan-inc/Baichuan-13B-Chat | baichuan-inc/Baichuan-7B <br>baichuan-inc/Baichuan-13B-Base <br>baichuan-inc/Baichuan-13B-Chat |
| Baichuan2 | baichuan-inc | baichuan-inc/Baichuan2-7B-Base <br>baichuan-inc/Baichuan2-7B-Chat <br>baichuan-inc/Baichuan2-13B-Base <br>baichuan-inc/Baichuan2-13B-Chat | baichuan-inc/Baichuan2-7B-Base <br>baichuan-inc/Baichuan2-7B-Chat <br>baichuan-inc/Baichuan2-13B-Base <br>baichuan-inc/Baichuan2-13B-Chat |
Yi | Yi | 01-ai | 01-ai/Yi-6B <br>01-ai/Yi-6B-200K <br>01-ai/Yi-9B <br>01-ai/Yi-9B-200K | 01-ai/Yi-6B <br>01-ai/Yi-6B-200K <br>01-ai/Yi-9B <br>01-ai/Yi-9B-200K |
| Yi-1.5 | 01-ai | 01-ai/Yi-1.5-6B <br>01-ai/Yi-1.5-6B-Chat <br>01-ai/Yi-1.5-9B <br>01-ai/Yi-1.5-9B-32K <br>01-ai/Yi-1.5-9B-Chat <br>01-ai/Yi-1.5-9B-Chat-16K | 01-ai/Yi-1.5-6B <br>01-ai/Yi-1.5-6B-Chat <br>01-ai/Yi-1.5-9B <br>01-ai/Yi-1.5-9B-32K <br>01-ai/Yi-1.5-9B-Chat <br>01-ai/Yi-1.5-9B-Chat-16K |
bloom | bloom | bigscience | bigscience/bloom-560m <br>bigscience/bloomz-560m | bigscience/bloom-560m <br>bigscience/bloomz-560m |
Qwen | Qwen | 阿里云 | Qwen/Qwen-1_8B <br>Qwen/Qwen-1_8B-Chat <br>Qwen/Qwen-7B <br>Qwen/Qwen-7B-Chat <br>Qwen/Qwen-14B <br>Qwen/Qwen-14B-Chat | Qwen/Qwen-1_8B <br>Qwen/Qwen-1_8B-Chat <br>Qwen/Qwen-7B <br>Qwen/Qwen-7B-Chat <br>Qwen/Qwen-14B <br>Qwen/Qwen-14B-Chat |
| Qwen1.5 | 阿里云 | Qwen/Qwen1.5-0.5B <br>Qwen/Qwen1.5-0.5B-Chat <br>Qwen/Qwen1.5-1.8B <br>Qwen/Qwen1.5-1.8B-Chat <br>Qwen/Qwen1.5-7B <br>Qwen/Qwen1.5-7B-Chat <br>Qwen/Qwen1.5-14B <br>Qwen/Qwen1.5-14B-Chat | Qwen/Qwen1.5-0.5B <br>Qwen/Qwen1.5-0.5B-Chat <br>Qwen/Qwen1.5-1.8B <br>Qwen/Qwen1.5-1.8B-Chat <br>Qwen/Qwen1.5-7B <br>Qwen/Qwen1.5-7B-Chat <br>Qwen/Qwen1.5-14B <br>Qwen/Qwen1.5-14B-Chat |
| Qwen2 | 阿里云 | Qwen/Qwen2-0.5B <br>Qwen/Qwen2-0.5B-Instruct <br>Qwen/Qwen2-1.5B <br>Qwen/Qwen2-1.5B-Instruct <br>Qwen/Qwen2-7B <br>Qwen/Qwen2-7B-Instruct | Qwen/Qwen2-0.5B <br>Qwen/Qwen2-0.5B-Instruct <br>Qwen/Qwen2-1.5B <br>Qwen/Qwen2-1.5B-Instruct <br>Qwen/Qwen2-7B <br>Qwen/Qwen2-7B-Instruct |
| Qwen2-VL | 阿里云 | Qwen/Qwen2-VL-2B-Instruct <br>Qwen/Qwen2-VL-7B-Instruct | Qwen/Qwen2-VL-2B-Instruct <br>Qwen/Qwen2-VL-7B-Instruct |
| Qwen2.5 | 阿里云 | Qwen/Qwen2.5-0.5B <br>Qwen/Qwen2.5-0.5B-Instruct <br>Qwen/Qwen2.5-1.5B <br>Qwen/Qwen2.5-1.5B-Instruct <br>Qwen/Qwen2.5-3B <br>Qwen/Qwen2.5-3B-Instruct <br>Qwen/Qwen2.5-7B <br>Qwen/Qwen2.5-7B-Instruct <br>Qwen/Qwen2.5-14B <br>Qwen/Qwen2.5-14B-Instruct | Qwen/Qwen2.5-0.5B <br>Qwen/Qwen2.5-0.5B-Instruct <br>Qwen/Qwen2.5-1.5B <br>Qwen/Qwen2.5-1.5B-Instruct <br>Qwen/Qwen2.5-3B <br>Qwen/Qwen2.5-3B-Instruct <br>Qwen/Qwen2.5-7B <br>Qwen/Qwen2.5-7B-Instruct <br>Qwen/Qwen2.5-14B <br>Qwen/Qwen2.5-14B-Instruct |
InternLM | InternLM | 上海人工智能实验室 | internlm/internlm-7b <br>internlm/internlm-chat-7b | internlm/internlm-7b <br>internlm/internlm-chat-7b |
| InternLM2 | 上海人工智能实验室 | internlm/internlm2-1_8b <br>internlm/internlm2-chat-1_8b <br>internlm/internlm2-7b <br>internlm/internlm2-chat-7b <br>internlm/internlm2-20b <br>internlm/internlm2-chat-20b | internlm/internlm2-1_8b <br>internlm/internlm2-chat-1_8b <br>internlm/internlm2-7b <br>internlm/internlm2-chat-7b |
| InternLM2.5 | 上海人工智能实验室 | internlm/internlm2_5-7b <br>internlm/internlm2_5-7b-chat <br>internlm/internlm2_5-7b-chat-1m | internlm/internlm2_5-7b <br>internlm/internlm2_5-7b-chat <br>internlm/internlm2_5-7b-chat-1m |
Falcon | Falcon | tiiuae | tiiuae/falcon-rw-1b <br>tiiuae/falcon-7b <br>tiiuae/falcon-7b-instruct | tiiuae/falcon-rw-1b <br>tiiuae/falcon-7b <br>tiiuae/falcon-7b-instruct |
DeepSeek | DeepSeek-MoE | 深度求索 | deepseek-ai/deepseek-moe-16b-base <br>deepseek-ai/deepseek-moe-16b-chat | deepseek-ai/deepseek-moe-16b-base <br>deepseek-ai/deepseek-moe-16b-chat |
| DeepSeek-LLM | 深度求索 | deepseek-ai/deepseek-llm-7b-base <br>deepseek-ai/deepseek-llm-7b-chat | deepseek-ai/deepseek-llm-7b-base <br>deepseek-ai/deepseek-llm-7b-chat |
| DeepSeek-V2 | 深度求索 | deepseek-ai/DeepSeek-V2-Lite <br>deepseek-ai/DeepSeek-V2-Lite-Chat | deepseek-ai/DeepSeek-V2-Lite <br>deepseek-ai/DeepSeek-V2-Lite-Chat |
| DeepSeek-Coder | 深度求索 | deepseek-ai/deepseek-coder-1.3b-base <br>deepseek-ai/deepseek-coder-1.3b-instruct <br>deepseek-ai/deepseek-coder-6.7b-base <br>deepseek-ai/deepseek-coder-6.7b-instruct <br>deepseek-ai/deepseek-coder-7b-base-v1.5 <br>deepseek-ai/deepseek-coder-7b-instruct-v1.5 | deepseek-ai/deepseek-coder-1.3b-base <br>deepseek-ai/deepseek-coder-1.3b-instruct <br>deepseek-ai/deepseek-coder-6.7b-base <br>deepseek-ai/deepseek-coder-6.7b-instruct <br>deepseek-ai/deepseek-coder-7b-base-v1.5 <br>deepseek-ai/deepseek-coder-7b-instruct-v1.5 |
| DeepSeek-Coder-V2 | 深度求索 | deepseek-ai/DeepSeek-Coder-V2-Lite-Base <br>deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct | deepseek-ai/DeepSeek-Coder-V2-Lite-Base <br>deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct |
| DeepSeek-Math | 深度求索 | deepseek-ai/deepseek-math-7b-base <br>deepseek-ai/deepseek-math-7b-instruct <br>deepseek-ai/deepseek-math-7b-rl | deepseek-ai/deepseek-math-7b-base <br>deepseek-ai/deepseek-math-7b-instruct <br>deepseek-ai/deepseek-math-7b-rl |
MiniCPM | MiniCPM | OpenBMB | openbmb/MiniCPM-2B-sft-bf16 <br>openbmb/MiniCPM-2B-dpo-bf16 <br>openbmb/MiniCPM-2B-128k <br>openbmb/MiniCPM-1B-sft-bf16 | openbmb/MiniCPM-2B-sft-bf16 <br>openbmb/MiniCPM-2B-dpo-bf16 <br>openbmb/MiniCPM-2B-128k <br>openbmb/MiniCPM-1B-sft-bf16 |
| MiniCPM-V | OpenBMB | openbmb/MiniCPM-V-2_6 <br>openbmb/MiniCPM-Llama3-V-2_5 | openbmb/MiniCPM-V-2_6 <br>openbmb/MiniCPM-Llama3-V-2_5 |
embedding | text2vec-base-chinese | shibing624 | shibing624/text2vec-base-chinese | shibing624/text2vec-base-chinese |
| m3e | moka-ai | moka-ai/m3e-base | moka-ai/m3e-base |
| bge | BAAI | BAAI/bge-large-en-v1.5 <br>BAAI/bge-large-zh-v1.5 <br>BAAI/bge-base-en-v1.5 <br>BAAI/bge-base-zh-v1.5 <br>BAAI/bge-small-en-v1.5 <br>BAAI/bge-small-zh-v1.5 | BAAI/bge-large-en-v1.5 <br>BAAI/bge-large-zh-v1.5 <br>BAAI/bge-base-en-v1.5 <br>BAAI/bge-base-zh-v1.5 <br>BAAI/bge-small-en-v1.5 <br>BAAI/bge-small-zh-v1.5 |
| gte | thenlper | thenlper/gte-large-zh <br>thenlper/gte-base-zh | thenlper/gte-base-zh <br>thenlper/gte-large-zh |