Awesome
Awesome-LLM4Tool ⚙️
Awesome-LLM4Tool is a curated list of papers, repositories, tutorials, and anything related to the large language models for tools.
Papers
TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
</br>
Jingqing Ruan, Yihong Chen, Bin Zhang, Zhiwei Xu, Tianpeng Bao, Guoqing Du, Shiwei Shi, Hangyu Mao, Xingyu Zeng, Rui Zhao
</br>
[arXiv
]
</br>
Aug 7 2023
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
</br>
Cheng-Yu Hsieh, Si-An Chen, Chun-Liang Li, Yasuhisa Fujii, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister
</br>
[arXiv
]
</br>
Aug 1 2023
</br>
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs
</br>
Yujia Qin, Shihao Liang, Yining Ye, Kunlun Zhu, Lan Yan, Yaxi Lu, Yankai Lin, Xin Cong, Xiangru Tang, Bill Qian, Sihan Zhao, Runchu Tian, Ruobing Xie, Jie Zhou, Mark Gerstein, Dahai Li, Zhiyuan Liu, Maosong Sun
</br>
[arXiv
] [Code
]
</br>
July 31 2023
</br>
ToolQA: A Dataset for LLM Question Answering with External Tools
</br>
Yuchen Zhuang, Yue Yu, Kuan Wang, Haotian Sun, Chao Zhang
</br>
[arXiv
] [Code
]
</br>
June 23 2023
</br>
On the Tool Manipulation Capability of Open-source Large Language Models
</br>
Qiantong Xu, Fenglu Hong, Bo Li, Changran Hu, Zhengyu Chen, Jian Zhang
</br>
[arXiv
] [Code
]
</br>
May 25 2023
</br>
Making Language Models Better Tool Learners with Execution Feedback
</br>
Shuofei Qiao, Honghao Gui, Huajun Chen, Ningyu Zhang
</br>
[arXiv
] [Code
]
</br>
May 22 2023
</br>
🦍 Gorilla: Large Language Model Connected with Massive APIs
</br>
Shishir G. Patil*, Tianjun Zhang*, Xin Wang, Joseph E. Gonzalez
</br>
[arXiv
] [Project Page
] [Code
]
</br>
May 24 2023
</br>
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings
</br>
Shibo Hao, Tianyang Liu, Zhen Wang, Zhiting Hu
</br>
[arXiv
] [Code
]
</br>
May 19 2023
</br>
InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
</br>
Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Zeqiang Lai, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, Limin Wang, Ping Luo, Jifeng Dai, Yu Qiao
</br>
[arXiv
] [Project Page
] [Code
]
</br>
May 9 2023
</br>
GPT4Tools: Teaching LLM to Use Tools via Self-instruction
</br>
Lin Song, Yanwei Li, Rui Yang, Sijie Zhao, Yixiao Ge, Ying Shan
</br>
[arXiv
] [Huggingface
] [Code
]
</br>
April 24 2023
</br>
🌋 LLaVA: Large Language and Vision Assistant
</br>
Haotian Liu*, Chunyuan Li*, Qingyang Wu, Yong Jae Lee
</br>
[arXiv
] [Project Page
] [Code
]
</br>
April 18 2023
</br>
Tool Learning with Foundation Models
</br>
Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang, Jing Yi, Yuzhang Zhu, Zhenning Dai, Lan Yan, Xin Cong, Yaxi Lu, Weilin Zhao, Yuxiang Huang, Junxi Yan, Xu Han, Xian Sun, Dahai Li, Jason Phang, Cheng Yang, Tongshuang Wu, Heng Ji, Zhiyuan Liu, Maosong Sun
</br>
[arXiv
] [Code
]
</br>
April 17 2023
API-Bank: A Benchmark for Tool-Augmented LLMs
</br>
Minghao Li, Feifan Song, Bowen Yu, Haiyang Yu, Zhoujun Li, Fei Huang, Yongbin Li
</br>
[arXiv
]
</br>
April 14 2023
</br>
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace
</br>
Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu and Yueting Zhuang
</br>
[arXiv
] [Huggingface
] [Code
]
</br>
Mar 30 2023
</br>
Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
</br>
Chenfei Wu, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, Nan Duan
</br>
[arXiv
] [Huggingface
] [Code
]
</br>
Mar 8 2023
</br>
Toolformer: Language Models Can Teach Themselves to Use Tools
</br>
Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Luke Zettlemoyer, Nicola Cancedda, Thomas Scialom
</br>
[arXiv
] [Code
]
</br>
9 Feb 2023
</br>
Visual Programming: Compositional Visual Reasoning without Training
</br>
Tanmay Gupta, Aniruddha Kembhavi
</br>
[arXiv
] [Code
]
</br>
Nov 18 2022
TALM: Tool Augmented Language Models
</br>
Aaron Parisi, Yao Zhao, Noah Fiedel
</br>
[arXiv
]
</br>
May 24 2022
MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning
</br>
Ehud Karpas, Omri Abend, Yonatan Belinkov, Barak Lenz, Opher Lieber, Nir Ratner, Yoav Shoham, Hofit Bata, Yoav Levine, Kevin Leyton-Brown, Dor Muhlgay, Noam Rozen, Erez Schwartz, Gal Shachaf, Shai Shalev-Shwartz, Amnon Shashua, Moshe Tenenholtz
</br>
[arXiv
] [Code
]
</br>
May 1 2022