Home

Awesome

[EMNLP'20 (Long)] SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning

A PyTorch implementation of SSCR

Paper | Slide

<img src='imgs/result.png' width='70%' />

Overview

SSCR is an implementation of <br> "SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning" <br> Tsu-Jui Fu, Xin Eric Wang, Scott Grafton, Miguel Eckstein, and William Yang Wang <br> in Conference on Empirical Methods in Natural Language Processing (EMNLP) 2020 (Long)

<img src='imgs/overview.png' width='80%' />

The iterative editor is a conditional generator that modifies an image based on current instruction and editing history. Counterfactual reasoning allows the model to think about the expected, resulting images under various counterfactual instructions. In this way, the editor can consider more diverse instructions than the original data to improve the generalizability, even if under data scarcity. With the help of this cross-task matching, we can not only provide a detailed token-level training signal to train the editor better but also supply training loss for counterfactual reasoning in a selfsupervised scenario.

Requirements

This code is implemented under Python3 and PyTorch. <br> Following libraries are also required:

Usage

visdom
python -m geneva.inference.train @example_args/iclevr-d-subtract.args
python -m geneva.inference.test @example_args/iclevr-d-subtract.args

Resources

Citation

@inproceedings{fu2020sscr, 
  author = {Tsu-Jui Fu and Xin Eric Wang and Scott Grafton and Miguel Eckstein and William Yang Wang}, 
  title = {{SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning}}, 
  booktitle = {Conference on Empirical Methods in Natural Language Processing (EMNLP)}, 
  year = {2020} 
}