Awesome
Awesome-LM-SSP
<img src="figure/title_new.png" alt="Awesome-LM-SSP" width="1000" height="auto" class="center">
Introduction
The resources related to the trustworthiness of large models (LMs) across multiple dimensions (e.g., safety, security, and privacy), with a special focus on multi-modal LMs (e.g., vision-language models and diffusion models).
-
This repo is in progress :seedling: (currently manually collected).
-
Badges:
-
Model:
-
Comment: ...
-
Venue: ...
-
-
:sunflower: Welcome to recommend resources to us via <a href="https://github.com/ThuCCSLab/Awesome-LM-SSP/issues"> <img src="https://icons.iconarchive.com/icons/github/octicons/128/issue-opened-16-icon.png" width="15" height="15"></a> Issues with the following format (please fill in this table):
Title | Link | Code | Venue | Classification | Model | Comment |
---|---|---|---|---|---|---|
aa | arxiv | github | bb'23 | A1. Jailbreak | LLM | Agent |
News
- [2024.08.17] We collected
34
related papers from ACL'24! - [2024.05.13] We collected
7
related papers from S&P'24! - [2024.04.27] We adjusted the categories.
- [2024.01.20] We collected
3
related papers from NDSS'24! - [2024.01.17] We collected
108
related papers from ICLR'24! - [2024.01.09] 🚀 LM-SSP is released!
Collections
- Book (2)
- Competition (5)
- Leaderboard (3)
- Toolkit (10)
- Survey (32)
- Paper (1283)
- A. Safety (708)
- A0. General (17)
- A1. Jailbreak (283)
- A2. Alignment (74)
- A3. Deepfake (57)
- A4. Ethics (5)
- A5. Fairness (54)
- A6. Hallucination (109)
- A7. Prompt Injection (42)
- A8. Toxicity (67)
- B. Security (197)
- B0. General (7)
- B1. Adversarial Examples (83)
- B2. Poison & Backdoor (94)
- B3. System (13)
- C. Privacy (378)
- C0. General (28)
- C1. Contamination (13)
- C2. Copyright (131)
- C3. Data Reconstruction (44)
- C4. Membership Inference Attacks (34)
- C5. Model Extraction (10)
- C6. Privacy-Preserving Computation (70)
- C7. Property Inference Attacks (3)
- C8. Unlearning (45)
- A. Safety (708)
Star History
Acknowledgement
-
Organizers: Tianshuo Cong (丛天硕), Xinlei He (何新磊), Zhengyu Zhao (èµµæ£å®‡), Yugeng Liu (刘禹更), Delong Ran (冉德龙)
-
This project is inspired by LLM Security, Awesome LLM Security, LLM Security & Privacy, UR2-LLMs, PLMpapers, EvaluationPapers4ChatGPT