Home

Awesome

Learning Data Manipulation

This repo contains preliminary code of the following paper:

Learning Data Manipulation for Augmentation and Weighting
Zhiting Hu*, Bowen Tan*, Ruslan Salakhutdinov, Tom Mitchell, Eric P. Xing
NeurIPS 2019 (equal contribution)

Requirements

Code

Running

Running scripts for experiments are available in scripts/.

Results

All the detailed training logs are availble in results/.

(Note: The result numbers may be slightly different from those in the paper due to slightly different implementation details and random seeds, while the improvements over comparison methods are consistent.)

low data

SST-5
Base Model: BERTRen et al.WeightingAugmentation
33.32 ± 4.0436.09 ± 2.2636.51 ± 2.5437.55 ± 2.63
CIFAR-10
PretrainedNot Pretrained
Base Model: ResNet34.58 ± 4.1324.68 ± 3.29
Ren et al.23.29 ± 5.9522.26 ± 2.80
Weighting36.75 ± 3.0926.47 ± 1.69

imbalanced data

SST-2
20 : 100050 : 1000 100 : 1000
Base Model: BERT54.91 ± 5.9867.73 ± 9.2075.04 ± 4.51
Ren et al.74.61 ± 3.5476.89 ± 5.0780.73 ± 2.19
Weighting75.08 ± 4.9879.35 ± 2.5981.82 ± 1.88
CIFAR-10
20 : 100050 : 1000100 : 1000
Base Model: ResNet70.65 ± 4.9879.52 ± 4.8186.12 ± 3.37
Ren et al.76.68 ± 5.3577.34 ± 7.3878.57 ± 5.61
Weighting79.07 ± 5.0282.65 ± 5.1387.63 ± 3.72