Awesome

Help me write a poem: Instruction Tuning as a Vehicle for Collaborative Poetry Writing

Repo associated with the EMNLP 2022 paper (https://arxiv.org/pdf/2210.13669.pdf)

Code

Scripts to train the model with both deepspeed and model parallelism are in training_scripts with an associated README.
Once the models were trained we run automatic evaluation, comparing to pretrained models (T0 3B/ T0pp) and InstructGPT (via the OpenAI API), as directed by the README in eval_scripts/. We report success rates of models satisfying instructions, averaged over 5 runs.
Subsequently, we also perform human evaluation comparing outputs from our finetuned T5-11B model and InstructGPT as seen in human_eval/.
Finally we run a user study using the interface in demo_poetic/. Logs of all user interactions are provided in human_eval/poems_submissions

Data

Drive Link to the training and validation data in the instruction format along with the test sets used in the paper.
Model generations used in the paper for automatic evaluation.
Poems and user interactions in the collaborative user study we ran with CoPoet.
Model weights of our finetuned T5-11B model (TODO)

Citation

                              @article{chakrabarty2022help,
                                title={Help me write a poem: Instruction Tuning as a Vehicle for Collaborative Poetry Writing},
                                author={Chakrabarty, Tuhin and Padmakumar, Vishakh and He, He},
                                journal={arXiv preprint arXiv:2210.13669},
                                year={2022}
                              }