Awesome

Deepfake for the Good

We would like depict steps to reproduce outcomes we achieved and described in paper "Deepfake for the Good: Generating Avatars through Face-Swapping with Implicit Deepfake Generation". In the aforementioned work we show a novel approach to get 3D deepfake representation of a person using a single 2D photo and a set of images of a base face avatar. Below there is an illustrative movie of such attempt's outcome for Ms. Céline Dion.

Result video:

https://github.com/quereste/implicit-deepfake/assets/77748206/0072ebb6-cf71-4f98-95f3-b58a51612815

Update: 4D ImplicitDeepfake

Now it's possible to use our solution to create 4D deepfake representation. As for 3D version we use single 2D photo and video of a base face avatar. Below we present capabilities of our solution.

Original video:

https://github.com/quereste/implicit-deepfake/assets/77748206/ec3dc76a-a215-4d2c-87e2-21617200a768

Face to swap with:

Output video:

https://github.com/quereste/implicit-deepfake/assets/77748206/62675841-220a-467e-8cd7-b1ce76944168

It's also possible to change facial expressions

Paper	After ImplicitDeepfake	Changed expression
<img src="assets/paper_0001.png" alt="paper_0001" width="300" height="300">	<img src="assets/original_0001.png" alt="original_0001" width="300" height="300">	<img src="assets/changed_0001.png" alt="changed_0001" width="300" height="300">
<img src="assets/paper_0002.png" alt="paper_0002" width="300" height="300">	<img src="assets/original_0002.png" alt="original_0002" width="300" height="300">	<img src="assets/changed_0002.png" alt="changed_0002" width="300" height="300">
<img src="assets/paper_0003.png" alt="paper_0003" width="300" height="300">	<img src="assets/original_0003.png" alt="original_0003" width="300" height="300">	<img src="assets/changed_0003.png" alt="changed_0003" width="300" height="300">
<img src="assets/paper_0004.png" alt="paper_0004" width="300" height="300">	<img src="assets/original_0004.png" alt="original_0004" width="300" height="300">	<img src="assets/changed_0004.png" alt="changed_0004" width="300" height="300">
<img src="assets/paper_0005.png" alt="paper_0005" width="300" height="300">	<img src="assets/original_0005.png" alt="original_0005" width="300" height="300">	<img src="assets/changed_0005.png" alt="changed_0005" width="300" height="300">

Steps to reproduce for 3D ImplicitDeepfake

Download dataset from link

This dataset consists of CelebA picture of Ms. Céline Dion (file: famous.jpg) and a directory with train, validation and test pictures of a base face avatar. With every subdirectory there is associated a .json file, containing camera positions, from which specific photos were taken. We hereby stress, that the face avatar we use in this example comes from this link. We are thankful for this piece of work.

Convert every photo from the dataset to a 2D deepfake

Use a 2D deepfake of your choice to convert all the pictures from the dataset directory to their deepfake versions, using file famous.jpg as a target photo. For the experiments we conducted in the paper, we used GHOST deepfake (see citations).

Pick a 3D rendering model to be rewarded with a 3D deepfake representation of the target person from step 2

Both NeRF and Gaussian Splatting solutions (see citations) work fine, our pipeline does not demand any specific model though. The result from the short illustrative video comes from Gaussian Splatting model.

Notebook for your convenience

We created a notebook that covers steps 2 and 3 from above, assuming the use of Gaussian Splatting as the 3D rendering technique. Its content is based on similar notebooks from the repos of the matter. In case of any doubts, feel free to ask us. The notebook needs no further requirements when being run on Google Colab.

Steps to reproduce for 4D ImplicitDeepfake

Download the dataset (as noted here).

This dataset consists of directory with train, validation and test pictures of a base face avatar. With every subdirectory there is associated a .json file, containing camera positions, from which specific photos were taken.

Download photo famous.jpg
Convert every photo from the dataset to a 2D deepfake

Use 4D Facial avatars to get 4D avatar facial reconstruction.

Attention this model requires at least 80GB RAM!

Citation

Another 3D model we used was expertly created by Author here.

We would like to express our gratitude to the authors of Gaussian Splatting and NeRF model, along with the pytorch representation of the latter. We used Gaussian Splatting and NeRF to achieve 3D rendering results.

@misc{mildenhall2020nerf,
    title={NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis},
    author={Ben Mildenhall and Pratul P. Srinivasan and Matthew Tancik and Jonathan T. Barron and Ravi Ramamoorthi and Ren Ng},
    year={2020},
    eprint={2003.08934},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

@misc{lin2020nerfpytorch,
  title={NeRF-pytorch},
  author={Yen-Chen, Lin},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished={\url{https://github.com/yenchenlin/nerf-pytorch/}},
  year={2020}
}

@Article{kerbl3Dgaussians,
      author       = {Kerbl, Bernhard and Kopanas, Georgios and Leimk{\"u}hler, Thomas and Drettakis, George},
      title        = {3D Gaussian Splatting for Real-Time Radiance Field Rendering},
      journal      = {ACM Transactions on Graphics},
      number       = {4},
      volume       = {42},
      month        = {July},
      year         = {2023},
      url          = {https://repo-sam.inria.fr/fungraph/3d-gaussian-splatting/}
}

Big thanks to the authors of the 4D Facial Avatars model. We used it to obtain 4D rendering results.

@InProceedings{Gafni_2021_CVPR,
    author    = {Gafni, Guy and Thies, Justus and Zollh{\"o}fer, Michael and Nie{\ss}ner, Matthias},
    title     = {Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {8649-8658}
}

Last but not least, we hereby cite the 2D deepfake GHOST work that was used in our original pipeline.

@article{9851423,
         author={Groshev, Alexander and Maltseva, Anastasia and Chesakov, Daniil and Kuznetsov, Andrey and Dimitrov, Denis},
         journal={IEEE Access},
         title={GHOST—A New Face Swap Approach for Image and Video Domains},
         year={2022},
         volume={10},
         number={},
         pages={83452-83462},
         doi={10.1109/ACCESS.2022.3196668}
}