Awesome
CodeIE
This is the official repository for "CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors" (ACL 2023).
If you have any questions or suggestions, please feel free to open issues or email Peng Li (lip21 [at] m.fudan.edu.cn).
Introduction
Our codebase is mainly modified from the UIE and CoCoGen repositories. We obtained and processed the original NER and RE datasets according to UIE's workflow. Then we implemented the inference logic of CodeIE referring to CoCoGen's workflow. Finally we evaluated the model outputs based on UIE's evaluation process.
We sincerely thank these two works and previous works.
Notice
- A piece of not-so-good news is that the
Codex
models are now deprecated by OpenAI, which will have a significant impact on replicating this paper. Some possible solutions include applying for OpenAI's Researcher Access Program or accessing Codex on Azure OpenAI Service. - Since we are using the closed-source APIs of OpenAI, we are unaware of the technical details behind them, such as the specific pre-training corpora used. Therefore, there might be potential issues of data contamination in the evaluation of our paper. (Acknowledgement: Oscar Sainz)
Workflow
Please refer to the workflow.md file.
Citation
If the paper or this repository helps you, please cite our paper:
@inproceedings{DBLP:conf/acl/LiSTYWHQ23,
author = {Peng Li and
Tianxiang Sun and
Qiong Tang and
Hang Yan and
Yuanbin Wu and
Xuanjing Huang and
Xipeng Qiu},
editor = {Anna Rogers and
Jordan L. Boyd{-}Graber and
Naoaki Okazaki},
title = {CodeIE: Large Code Generation Models are Better Few-Shot Information
Extractors},
booktitle = {Proceedings of the 61st Annual Meeting of the Association for Computational
Linguistics (Volume 1: Long Papers), {ACL} 2023, Toronto, Canada,
July 9-14, 2023},
pages = {15339--15353},
publisher = {Association for Computational Linguistics},
year = {2023},
url = {https://doi.org/10.18653/v1/2023.acl-long.855},
doi = {10.18653/V1/2023.ACL-LONG.855},
timestamp = {Fri, 16 Feb 2024 08:27:36 +0100},
biburl = {https://dblp.org/rec/conf/acl/LiSTYWHQ23.bib},
bibsource = {dblp computer science bibliography, https://dblp.org}
}