Awesome
Jailbreak Large Vision-Language Models Through Multi-Modal Linkage
Code for the paper Jailbreak Large Vision-Language Models Through Multi-Modal Linkage
Data
We uploaded the encrypted images here
Run before Download it as ./dataset
Attack Commands
python attack.py --dataset 'safebench' \
--data-path 'dataset' \
--save-dir 'save_dir' \
--image-format 'images_wr'\
You can choose the encryption or attack methods by replacing the image-format
parameter. Here are some options:
-
images_figstep
: FigStep attack. -
images_qr
: QueryRelated attack. -
images_wr
: MML with word replacment -
images_miror
: MML with image mirroring. -
images_rotate
: MML with image rotation. -
images_base64
: MML with Base64-Encoding.
Main Results
Reference
If you find the code useful for your research, please consider citing
@article{wang2024jailbreak,
title={Jailbreak Large Vision-Language Models Through Multi-Modal Linkage},
author={Wang, Yu and Zhou, Xiaofei and Wang, Yichen and Zhang, Geyuan and He, Tianxing},
journal={arXiv preprint arXiv:2412.00473},
year={2024}
}