Awesome
DiffMask
Overview
This library contains a Pytorch implementation of Differentiable Masking Explainer (Diffmask), as presented in [1](https://arxiv.org/abs/2004.14992).
Dependencies
- python>=3.6
- pytorch>=1.5: https://pytorch.org
- pytorch-lightning==0.7.3: https://pytorch-lightning.readthedocs.io
- transformers>=2.9.0: https://github.com/huggingface/transformers
- torch-optimizer>=0.0.1a9: https://github.com/jettify/pytorch-optimizer
- matplotlib>=3.1.1: https://matplotlib.org
Notice that older or newer version could work but they were not tested.
Installation
To install, run
$ python setup.py install
To donwload datasets run
$ ./scripts/download_datasets.sh
To download models, use the following link. Note that these are not the exact same models used for the paper.
Structure
- diffmask: Contains the source code for DiffMask.
We have 5 jupyter notebbok with the code for reproducing some of the results from our work [1]. Note that since i) the code was refactored and ii) we were not able to realise the exact same models used for the paper, re-generated plots and tables might differ from the ones in our work.
- QuestionAnsweringSquadAnalysis Plots and analysis using other methods of a BERT large model for question answering.
- QuestionAnsweringSquadDiffMaskAnalysis Plots and analysis using DiffMask of a BERT large model for question answering.
- SentimentClassificationSSTAnalysis Plots and analysis using other methods of a BERT base model for sentiment analisys.
- SentimentClassificationSSTDiffMaskAnalysis Plots and analysis using DiffMask of a BERT base model for sentiment analisys.
- ToyTask Training, analysis and plots for the artificial toy task using both other methods and DiffMask.
Please cite [1] in your work when using this library in your experiments.
Feedback
For questions and comments, feel free to contact Nicola De Cao.
License
MIT
Citation
[1] Nicola De Cao, Michael Sejr Schlichtkrull, Wilker Aziz, and Ivan Titov. 2020.
How do Decisions Emerge across Layers in Neural Models? Interpretation with Differentiable Masking.
In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP),
pages 3243–3255, Online. Association for Computational Linguistics.
BibTeX format:
@inproceedings{de-cao-etal-2020-decisions,
title = "How do Decisions Emerge across Layers in Neural Models? Interpretation with Differentiable Masking",
author = "De Cao, Nicola and
Schlichtkrull, Michael Sejr and
Aziz, Wilker and
Titov, Ivan",
booktitle = "Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)",
month = nov,
year = "2020",
address = "Online",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2020.emnlp-main.262",
doi = "10.18653/v1/2020.emnlp-main.262",
pages = "3243--3255",
}