

The Journey, Not the Destination: How Data Guides Diffusion Models

arXiv Code style: black

Check out our blog post!

In our paper, we introduce a data attribution framework for diffusion models, together with an efficent method fo computing attribution scores. Given a generated image X and a diffusion model of interest, you can use our library to identify training examples which strongly guide the diffusion model towards generating X.

In particular, we provide attribution scores for each step of the diffusuion process:

<div style="width:65%; margin: 0 auto;"> <table style="width:100%;"> <tr> <td><img src="assets/mscoco.gif" alt="MSCOCO Attributions GIF"></td> <td><img src="assets/cifar.gif" alt="CIFAR10 Attributions GIF"></td> </tr> </table> </div>


Check out the examples. There, we:

Our code is based on the TRAK API.


If you use this code in your work, please cite using the following BibTeX entry:

      title={The Journey, Not the Destination: How Data Guides Diffusion Models}, 
      author={Kristian Georgiev and Joshua Vendrow and Hadi Salman and Sung Min Park and Aleksander Madry},
      booktitle = {Arxiv preprint arXiv:2312.06205},