Home

Awesome

AxCell: Automatic Extraction of Results from Machine Learning Papers

PWC PWC

This repository is the official implementation of AxCell: Automatic Extraction of Results from Machine Learning Papers.

pipeline

Requirements

To create a conda environment named axcell and install requirements run:

conda env create -f environment.yml

Additionally, axcell requires docker (that can be run without sudo). Run scripts/pull_docker_images.sh to download necessary images.

Datasets

We publish the following datasets:

See datasets notebook for an example of how to load the datasets provided below. The extraction notebook shows how to use axcell to extract text and tables from papers.

Evaluation

See the evaluation notebook for the full example on how to evaluate AxCell on the PWCLeaderboards dataset.

Training

Pre-trained Models

You can download pretrained models here:

Results

AxCell achieves the following performance:

DatasetMacro F1Micro F1
PWC Leaderboards21.128.7
NLP-TDMS19.725.8

License

AxCell is released under the Apache 2.0 license.

Citation

The pipeline is described in the following paper:

@inproceedings{axcell,
    title={AxCell: Automatic Extraction of Results from Machine Learning Papers},
    author={Marcin Kardas and Piotr Czapla and Pontus Stenetorp and Sebastian Ruder and Sebastian Riedel and Ross Taylor and Robert Stojnic},
    year={2020},
    booktitle={2004.14356}
}