Awesome

llm-resource（LLM 百宝箱）

LLM全栈优质资源汇总

非常欢迎大家也参与进来，收集更多优质大模型相关资源。

目录

🐼 LLM算法
🐘 LLM训练
- 🐘 LLM微调
- 🐼 LLM对齐
🔥 LLM推理
:palm_tree: LLM数据工程（Data Engineering）
📡 LLM压缩
🐰 LLM测评
🐘 AI基础知识
📡 AI基础设施
- :palm_tree: AI芯片
- 🐰 CUDA
🐘 AI编译器
🐰 AI框架
📡 LLM应用开发
🐘 LLMOps
📡 LLM实践
📡微信公众号文章集锦

LLM算法

Transformer

原理：

源码：

GPT1

GPT2

GPT2 源码：https://github.com/huggingface/transformers/blob/main/src/transformers/models/gpt2/modeling_gpt2.py
GPT2 源码解析：https://zhuanlan.zhihu.com/p/630970209
nanoGPT：https://github.com/karpathy/nanoGPT/blob/master/model.py
7.3 GPT2模型深度解析：http://121.199.45.168:13013/7_3.html
GPT（三）GPT2原理和代码详解: https://zhuanlan.zhihu.com/p/637782385
GPT2参数量剖析: https://zhuanlan.zhihu.com/p/640501114

ChatGPT

GLM

预训练语言模型：GLM

LLaMA

MOE 大模型

下一代大模型

多模态大模型

A Survey on Multimodal Large Language Models：https://arxiv.org/pdf/2306.13549 Efficient-Multimodal-LLMs-Survey：https://github.com/lijiannuist/Efficient-Multimodal-LLMs-Survey

LLM训练

LLM微调

Adapting P-Tuning to Solve Non-English Downstream Tasks

LLM对齐

LLM推理

使用HuggingFace的Accelerate库加载和运行超大模型 : device_map、no_split_module_classes、 offload_folder、 offload_state_dict
借助 PyTorch，Accelerate 如何运行超大模型
使用 DeepSpeed 和 Accelerate 进行超快 BLOOM 模型推理
LLM七种推理服务框架总结
LLM投机采样（Speculative Sampling）为何能加速模型推理
大模型推理妙招—投机采样（Speculative Decoding）
https://github.com/flexflow/FlexFlow/tree/inference
TensorRT-LLM(3)--架构
NLP（十八）：LLM 的推理优化技术纵览：https://zhuanlan.zhihu.com/p/642412124
揭秘NVIDIA大模型推理框架：TensorRT-LLM：https://zhuanlan.zhihu.com/p/680808866
如何生成文本: 通过 Transformers 用不同的解码方法生成文本 | How to generate text: using different decoding methods for language generation with Transformers

大模型推理优化技术

KV Cache：

解码优化：

大模型推理妙招—投机采样（Speculative Decoding）

vLLM

LLM数据工程

An Initial Exploration of Theoretical Support for Language Model Data Engineering. Part 1: Pretraining @ 符尧

LLM压缩

LLM测评

CLiB中文大模型能力评测榜单
huggingface Open LLM Leaderboard
HELM：https://github.com/stanford-crfm/helm
HELM：https://crfm.stanford.edu/helm/latest/
lm-evaluation-harness：https://github.com/EleutherAI/lm-evaluation-harness/
CLEVA：http://www.lavicleva.com/#/homepage/overview
CLEVA：https://github.com/LaVi-Lab/CLEVA/blob/main/README_zh-CN.md

提示工程

综合

safetensors：

AI框架

PyTorch

PyTorch 源码解读系列 @ OpenMMLab 团队
[源码解析] PyTorch 分布式 @ 罗西的思考
PyTorch 分布式(18) --- 使用 RPC 的分布式流水线并行 @ 罗西的思考
【Pytorch】model.train() 和 model.eval() 原理与用法

DeepSpeed

Megatron-LM

Megatron-DeepSpeed

Huggingface Transformers

AI基础知识

AI基础设施

AI芯片

业界AI加速芯片浅析（一）百度昆仑芯
NVIDIA CUDA-X AI：https://www.nvidia.cn/technologies/cuda-x/
Intel，Nvidia，AMD三大巨头火拼GPU与CPU
处理器与AI芯片-Google-TPU：https://zhuanlan.zhihu.com/p/646793355
一文看懂国产AI芯片玩家
深度 | 国产AI芯片，玩家几何

CUDA

AI编译器

TVM资料
AI编译器原理 @ZIMO酱

LLM应用开发

LLMOps

MLOps Landscape in 2023: Top Tools and Platforms
What Constitutes A Large Language Model Application? ：LLM Functionality Landscape
AI System @吃果冻不吐果冻皮

RAG

https://github.com/hymie122/RAG-Survey

书籍

大语言模型原理与工程 @杨青
大语言模型从理论到实践 @张奇：https://intro-llm.github.io/
动手学大模型

LLM实践

minGPT @karpathy
llm.c @karpathy: LLM training in simple, raw C/CUDA
LLM101n
llama2.c: Inference Llama 2 in one file of pure C
nanoGPT
Baby-Llama2-Chinese
从0到1构建一个MiniLLM
gpt-fast 、blog

大模型汇总资料

微信公众号文章集锦

其他

Hugging Face 博客