Home

Awesome

The Rise and Potential of Large Language Model Based Agents: A Survey

🔥 Must-read papers for LLM-based agents.

🏃 Coming soon: Add one-sentence intro to each paper.

🔔 News

<div align=center><img src="./assets/figure1.jpg" width="80%" /></div>

🌟 Introduction

For a long time, humanity has pursued artificial intelligence (AI) equivalent to or surpassing human level, with AI agents considered as a promising vehicle of this pursuit. AI agents are artificial entities that sense their environment, make decisions, and take actions.

Due to the versatile and remarkable capabilities they demonstrate, large language models (LLMs) are regarded as potential sparks for Artificial General Intelligence (AGI), offering hope for building general AI agents. Many research efforts have leveraged LLMs as the foundation to build AI agents and have achieved significant progress.

In this repository, we provide a systematic and comprehensive survey on LLM-based agents, and list some must-read papers.

Specifically, we start by the general conceptual framework for LLM-based agents: comprising three main components: brain, perception, and action, and the framework can be tailored to suit different applications. Subsequently, we explore the extensive applications of LLM-based agents in three aspects: single-agent scenarios, multi-agent scenarios, and human-agent cooperation. Following this, we delve into agent societies, exploring the behavior and personality of LLM-based agents, the social phenomena that emerge when they form societies, and the insights they offer for human society. Finally, we discuss a range of key topics and open problems within the field.

We greatly appreciate any contributions via PRs, issues, emails, or other methods.

Table of Content (ToC)

1. The Birth of An Agent: Construction of LLM-based Agents

<div align=center><img src="./assets/figure2.jpg" width="80%" /></div>

1.1 Brain: Primarily Composed of An LLM

1.1.1 Natural Language Interaction

High-quality generation
Deep understanding

1.1.2 Knowledge

Pretrain model
Linguistic knowledge
Commonsense knowledge
Actionable knowledge
Potential issues of knowledge

1.1.3 Memory

Memory capability
Raising the length limit of Transformers
Summarizing memory
Compressing memories with vectors or data structures
Memory retrieval

1.1.4 Reasoning & Planning

Reasoning
Planning
Plan formulation
Plan reflection

1.1.5 Transferability and Generalization

Unseen task generalization
In-context learning
Continual learning

1.2 Perception: Multimodal Inputs for LLM-based Agents

1.2.1 Visual

1.2.2 Audio

1.3 Action: Expand Action Space of LLM-based Agents

1.3.1 Tool Using

1.3.2 Embodied Action

2. Agents in Practice: Applications of LLM-based Agents

<div align=center><img src="./assets/figure7.jpg" width="60%" /></div>

2.1 General Ability of Single Agent

<div align=center><img src="./assets/figure8.jpg" width="60%" /></div>

2.1.1 Task-oriented Deployment

In web scenarios

In life scenarios

2.1.2 Innovation-oriented Deployment

2.1.3 Lifecycle-oriented Deployment

2.2 Coordinating Potential of Multiple Agents

<div align=center><img src="./assets/figure9.jpg" width="60%" /></div>

2.2.1 Cooperative Interaction for Complementarity

Disordered cooperation

Ordered cooperation

2.2.2 Adversarial Interaction for Advancement

2.3 Interactive Engagement between Human and Agent

<div align=center><img src="./assets/figure10.jpg" width="60%" /></div>

2.3.1 Instructor-Executor Paradigm

Education
Health
Other Application

2.3.2 Equal Partnership Paradigm

Empathetic Communicator
Human-Level Participant

3. Agent Society: From Individuality to Sociality

<div align=center><img src="./assets/figure12.jpg" width="60%" /></div>

3.1 Behavior and Personality of LLM-based Agents

3.1.1 Social Behavior

Individual behaviors
Group behaviors

3.1.2 Personality

Cognition
Emotion
Character

3.2 Environment for Agent Society

3.2.1 Text-based Environment

3.2.2 Virtual Sandbox Environment

3.2.3 Physical Environment

3.3 Society Simulation with LLM-based Agents

4. Other Topics

4.1 Benchmarks for LLM-based Agents

4.2 Training and Optimizing LLM-based Agents

Citation

If you find this repository useful, please cite our paper:

@misc{xi2023rise,
      title={The Rise and Potential of Large Language Model Based Agents: A Survey}, 
      author={Zhiheng Xi and Wenxiang Chen and Xin Guo and Wei He and Yiwen Ding and Boyang Hong and Ming Zhang and Junzhe Wang and Senjie Jin and Enyu Zhou and Rui Zheng and Xiaoran Fan and Xiao Wang and Limao Xiong and Yuhao Zhou and Weiran Wang and Changhao Jiang and Yicheng Zou and Xiangyang Liu and Zhangyue Yin and Shihan Dou and Rongxiang Weng and Wensen Cheng and Qi Zhang and Wenjuan Qin and Yongyan Zheng and Xipeng Qiu and Xuanjing Huang and Tao Gui},
      year={2023},
      eprint={2309.07864},
      archivePrefix={arXiv},
      primaryClass={cs.AI}
}

Project Maintainers & Contributors

Contact

Star History

Star History Chart