Awesome

Awesome-LM-SSP

<img src="figure/title_new.png" alt="Awesome-LM-SSP" width="1000" height="auto" class="center">

Introduction

The resources related to the trustworthiness of large models (LMs) across multiple dimensions (e.g., safety, security, and privacy), with a special focus on multi-modal LMs (e.g., vision-language models and diffusion models).

This repo is in progress :seedling: (currently manually collected).
Badges:
- Model:
- Comment: ...
- Venue: ...
:sunflower: Welcome to recommend resources to us via <a href="https://github.com/ThuCCSLab/Awesome-LM-SSP/issues"> <img src="https://icons.iconarchive.com/icons/github/octicons/128/issue-opened-16-icon.png" width="15" height="15"></a> Issues with the following format (please fill in this table):

Title	Link	Code	Venue	Classification	Model	Comment
aa	arxiv	github	bb'23	A1. Jailbreak	LLM	Agent

News

[2024.08.17] We collected 34 related papers from ACL'24!
[2024.05.13] We collected 7 related papers from S&P'24!
[2024.04.27] We adjusted the categories.
[2024.01.20] We collected 3 related papers from NDSS'24!
[2024.01.17] We collected 108 related papers from ICLR'24!
[2024.01.09] 🚀 LM-SSP is released!

Collections

Book (2)
Competition (5)
Leaderboard (3)
Toolkit (10)
Survey (32)
Paper (1283)
- A. Safety (708)
  - A0. General (17)
  - A1. Jailbreak (283)
  - A2. Alignment (74)
  - A3. Deepfake (57)
  - A4. Ethics (5)
  - A5. Fairness (54)
  - A6. Hallucination (109)
  - A7. Prompt Injection (42)
  - A8. Toxicity (67)
- B. Security (197)
  - B0. General (7)
  - B1. Adversarial Examples (83)
  - B2. Poison & Backdoor (94)
  - B3. System (13)
- C. Privacy (378)
  - C0. General (28)
  - C1. Contamination (13)
  - C2. Copyright (131)
  - C3. Data Reconstruction (44)
  - C4. Membership Inference Attacks (34)
  - C5. Model Extraction (10)
  - C6. Privacy-Preserving Computation (70)
  - C7. Property Inference Attacks (3)
  - C8. Unlearning (45)

Star History

Acknowledgement

Organizers: Tianshuo Cong (丛天硕), Xinlei He (何新磊), Zhengyu Zhao (赵正宇), Yugeng Liu (刘禹更), Delong Ran (冉德龙)
This project is inspired by LLM Security, Awesome LLM Security, LLM Security & Privacy, UR2-LLMs, PLMpapers, EvaluationPapers4ChatGPT

<p align="center"><img src="figure/logo.png" width="900" /></p>