Awesome
Two Tales of Persona in LLMs: <br> A Survey of Role-Playing and Personalization
<br> <p align="center"> <img src="figures/Overview.png" alt="Overview" width="40%" height="auto"> </p>Introduction
This is the official repository of the paper "Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization", EMNLP 2024 Findings.
The concept of persona, originally adopted in dialogue literature, has re-surged as a promising framework for tailoring large language models (LLMs) to specific context (e.g., personalized search, LLM-as-a-judge). However, the growing research on leveraging persona in LLMs is relatively disorganized and lacks a systematic taxonomy. To close the gap, we present a comprehensive survey to categorize the current state of the field. We identify two lines of research, namely (1) LLM Role-Playing, where personas are assigned to LLMs, and (2) LLM Personalization, where LLMs take care of user personas. Additionally, we introduce existing methods for LLM personality evaluation. To the best of our knowledge, we present the first survey for role-playing and personalization in LLMs under the unified view of persona.
We continuously maintain this paper collection to foster future endeavors.
<div style="display: flex; align-items: center;"> <div style="flex: 1;">News
- [2024.10.05] :dart: We update the camera-ready version on arXiv. Click the link to check it out!
- [2024.09.20] :confetti_ball: Excited to share that our paper is accepted at EMNLP 2024 Findings! Hooray :raised_hands:!
- [2024.06.27] :fire: We update an 8-page version on arXiv.
- [2024.06.04] :rocket: Our paper is now available on arXiv and the reading list on GitHub.
Table of Contents
LLMs are tasked to play the assigned personas (i.e., roles) and act accordance to environmental feedback.
The key aspect is how LLMs adapt to defined environments.
<br> <p align="center"> <a href="."> <img src="figures/llm-role-playing.png" alt="LLM role-playing" width="90%" height="auto"> </a> </p> <h3 id="role-playing-workshops">๐ผ Workshops</h3>Date | Workshop | Website Link |
---|---|---|
2405 | LLMAgent @ ICLR | ICLR 2024 Workshop on Large Language Model (LLM) Agents |
2405 | Agent Workshop @ CMU | CMU Agent Workshop 2024 |
Date | Authors | Venue | Paper |
---|---|---|---|
2308 | Hong et al. | ICLR | MetaGPT: Meta Programming for Multi-Agent Collaborative Framework |
2307 | Qian et al. | arXiv | Communicative agents for software development |
2305 | Dong et al. | TOSEM | Self-collaboration code generation via chatgpt |
2107 | Chen et al. | arXiv | Evaluating large language models trained on code |
Date | Authors | Venue | Paper |
---|---|---|---|
2404 | Liu et al. | arXiv | VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding? |
2401 | Zheng et al. | LLMAgent @ ICLR | GPT-4V(ision) is a Generalist Web Agent, if Grounded |
2401 | Koh et al. | LLMAgent @ ICLR | VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks |
2401 | Cheng et al. | LLMAgent @ ICLR | SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents |
2312 | Gur et al. | EMNLP | Understanding HTML with Large Language Models |
2312 | Hong et al. | arXiv | CogAgent: A Visual Language Model for GUI Agents |
2307 | Gur et al. | ICLR | A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis |
2307 | Zhou et al. | ICLR | WebArena: A Realistic Web Environment for Building Autonomous Agents |
2306 | Deng et al. | NeurIPS | Mind2web: Towards a generalist agent for the web |
2303 | Kim et al. | NeurIPS | Language Models can Solve Computer Tasks |
Date | Authors | Venue | Paper |
---|---|---|---|
2310 | Wang et al. | EMNLP | Humanoid Agents: Platform for Simulating Human-like Generative Agents |
2305 | Wang et al. | TMLR | Voyager: An open-ended embodied agent with large language models |
2305 | Fu et al. | arXiv | Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback |
2304 | Park et al. | UIST | Generative agents: Interactive simulacra of human behavior |
Date | Authors | Venue | Paper |
---|---|---|---|
2312 | Kwon et al. | AAAI | Large Language Models are Clinical Reasoners: Reasoning-Aware Diagnosis Framework with Prompt-Generated Rationales |
2311 | Tang et al. | arXiv | Medagents: Large language models as collaborators for zero-shot medical reasoning |
2307 | Wu et al. | ICLR | Large Language Models Perform Diagnostic Reasoning |
2207 | Liรฉvin et al. | arXiv | Can large language models reason about medical questions? |
Date | Authors | Venue | Paper |
---|---|---|---|
2308 | Chan et al. | ICLR | ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate |
2303 | Wu et al. | NLPCC | Large Language Models are Diverse Role-Players for Summarization Evaluation |
Date | Authors | Venue | Paper |
---|---|---|---|
2405 | Ahn, et al | ACL Findings | TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models |
2308 | Chen et al. | ICLR | AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors |
2307 | Wang et al. | NAACL | Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration |
2303 | Li et al. | NeurIPS | CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society |
Date | Authors | Venue | Paper |
---|---|---|---|
2401 | Cheng et al. | LLMAgent @ ICLR | SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents |
2401 | Zheng et al. | LLMAgent @ ICLR | GPT-4V(ision) is a Generalist Web Agent, if Grounded |
2312 | Hong et al. | arXiv | CogAgent: A Visual Language Model for GUI Agents |
2305 | Wang et al. | TMLR | Voyager: An open-ended embodied agent with large language models |
Date | Authors | Venue | Paper |
---|---|---|---|
2311 | Tang et al. | arXiv | Medagents: Large language models as collaborators for zero-shot medical reasoning |
2308 | Chen et al. | ICLR | AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors |
2308 | Hong et al. | ICLR | MetaGPT: Meta Programming for Multi-Agent Collaborative Framework |
2308 | Chan et al. | ICLR | ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate |
2307 | Qian et al. | arXiv | Communicative agents for software development |
2305 | Fu et al. | arXiv | Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback |
Date | Authors | Venue | Paper |
---|---|---|---|
2311 | Tang et al. | arXiv | Medagents: Large language models as collaborators for zero-shot medical reasoning |
2308 | Chen et al. | ICLR | AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors |
2307 | Wang et al. | NAACL | Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration |
2305 | Fu et al. | arXiv | Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback |
2303 | Li et al. | NeurIPS | CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society |
LLMs are tasked to take care of usersโ personas (e.g., background information, or historical behaviors) to meet customized needs.
The key aspect is how LLMs adapt to distinct users.
<p align="center"> <a href="."> <img src="figures/llm-personalization.png" alt="LLM personalization" width="90%" height="auto"> </a> </p> <h3 id="personalization-workshops">๐ผ Workshops & Competitions</h3>Date | Authors | Venue | Paper |
---|---|---|---|
2403 | Deshpande et al. | PERSONALIZE @ EACL | Proceedings of the 1st Workshop on Personalization of Generative AI Systems (PERSONALIZE 2024) |
2310 | Chen et al. | Personalized Generative AI @ CIKM | The First Workshop on Personalized Generative AI @ CIKM 2023: Personalization Meets Large Language Models |
1902 | Dinan et al. | ConvAI2 @ NeurIPS | The Second Conversational Intelligence Challenge (ConvAI2) |
1808 | Yusupov et al. | ConvAI @ NeurIPS | NIPS Conversational Intelligence Challenge 2017 Winner System: Skill-based Conversational Agent with Supervised Dialog Manager |
LLMs Era
Date | Authors | Venue | Paper |
---|---|---|---|
2305 | Yang et al. | EMNLP | RefGPT: Dialogue Generation of GPT, by GPT, and for GPT |
2302 | Li et al. | NeurIPS | Guiding large language models via directional stimulus prompting |
2005 | Hosseini-Asl et al. | NeurIPS | A Simple Language Model for Task-Oriented Dialogue |
Pre-LLMs Era
Date | Authors | Venue | Paper |
---|---|---|---|
2006 | Jianhong Wang et al. | ICLR | Modelling hierarchical structure between dialogue policy and natural language generator with option framework for task-oriented dialogue system |
1606 | N. Mrksic et al. | ACL | Neural Belief Tracker: Data-Driven Dialogue State Tracking |
1506 | Alessandro Sordoni et al. | NAACL | A Neural Network Approach to Context-Sensitive Generation of Conversational Responses |
Date | Authors | Venue | Paper |
---|---|---|---|
2405 | Han | ACL | PSYDIAL: Personality-based Synthetic Dialogue Generation Using Large Language Models |
2307 | Tang et al. | ACL | Enhancing Personalized Dialogue Generation with Contrastive Latent Variables: Combining Sparse and Dense Persona |
1807 | Zhang et al. | ACL | Personalizing Dialogue Agents: I have a dog, do you have pets too? |
Date | Authors | Venue | Paper |
---|---|---|---|
2305 | Yang et al. | arXiv | PALR: Personalization Aware LLMs for Recommendation |
2304 | Wang et al. | arXiv | Zero-Shot Next-Item Recommendation using Large Pretrained Language Models |
2108 | Li et al. | ACL | Personalized Transformer for Explainable Recommendation |
Date | Authors | Venue | Paper |
---|---|---|---|
2405 | Zhou et al. | WWW | Cognitive personalized search integrating large language models with an efficient memory mechanism |
2405 | Baek et al. | WWW | Knowledge-Augmented Large Language Models for Personalized Contextual Query Suggestion |
2405 | Salemi | arXiv | Unified ranking for multiple retrieval-augmented large language models |
2402 | Sharma et al. | CHI | Generative echo chamber? effects of llm-powered search systems on diverse information seeking |
2307 | Eleni et al. | arXiv | Comparing Traditional and LLM-based Search for Consumer Choice: A Randomized Experiment |
2307 | Ziems et al. | ACL | Large Language Models are Built-in Autoregressive Search Engines |
2107 | Zhou et al. | SIGIR | Group based Personalized Search by Integrating Search Behaviour and Friend Network |
Date | Authors | Venue | Paper |
---|---|---|---|
2402 | Abbasian et al. | arXiv | Knowledge-Infused LLM-Powered Conversational Health Agent: A Case Study for Diabetes Patients |
2402 | Jin et al. | arXiv | Health-LLM: Personalized Retrieval-Augmented Disease Prediction System |
2310 | Abbasian et al. | arXiv | Conversational Health Agents: A Personalized LLM-Powered Agent Framework |
2309 | Zhang et al. | arXiv | LLM-based Medical Assistant Personalization with Short- and Long-Term Memory Coordination |
Date | Authors | Venue | Paper |
---|---|---|---|
2403 | Park et al. | CHI | Empowering personalized learning through a conversation-based tutoring system with student modeling |
2308 | Dan et al. | arXiv | Educhat: A large-scale language model-based chatbot system for intelligent education |
2307 | Shehata et al. | BEA @ ACL | Enhancing Video-based Learning Using Knowledge Tracing: Personalizing Studentsโ Learning Experience with ORBITS |
Date | Authors | Venue | Paper |
---|---|---|---|
2404 | Zhang et al. | arXiv | Personalized LLM Response Generation with Parameterized Memory Injection |
2403 | Zhong et al. | AAAI | MemoryBank: Enhancing Large Language Models with Long-Term Memory |
2402 | Sun et al. | arXiv | Persona-DB: Efficient Large Language Model Personalization for Response Prediction with Collaborative Data Refinement |
2402 | Tan et al. | arXiv | Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning |
2205 | Fu et al. | ACL | There Are a Thousand Hamlets in a Thousand Peopleโs Eyes: Enhancing Knowledge-grounded Dialogue with Personal Memory |
2106 | Wu et al. | NAACL | Personalized Response Generation via Generative Split Memory Network |
Date | Authors | Venue | Paper |
---|---|---|---|
2305 | Dai et al. | RecSys | Uncovering ChatGPTโs Capabilities in Recommender Systems |
2305 | Christakopoulou et al. | arXiv | Large Language Models for User Interest Journeys |
2305 | Zhiyuli et al. | arXiv | BookGPT: A General Framework for Book Recommendation Empowered by Large Language Model |
Date | Authors | Venue | Paper |
---|---|---|---|
2311 | Mysore et al. | arXiv | PEARL: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers |
2308 | Li et al. | arXiv | Teach LLMs to Personalize -- An Approach inspired by Writing Education |
2304 | Salemi et al. | arXiv | LaMP: When Large Language Models Meet Personalization |
Date | Authors | Venue | Paper |
---|---|---|---|
2405 | Li et al. | WWW | Learning to Rewrite Prompts for Personalized Text Generation |
2310 | Richardson et al. | arXiv | Integrating Summarization and Retrieval for Enhanced Personalization via Large Language Models |
2305 | Liu et al. | WSDM | ONCE: Boosting Content-based Recommendation with Both Open- and Closed-source Large Language Models |
Date | Authors | Venue | Paper |
---|---|---|---|
2401 | Huang et al. | ICLR | On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs |
2309 | Jiang et al. | NeurIPS | Evaluating and inducing personality in pre-trained language models |
2307 | Fang et al. | ACL | On Text-based Personality Computing: Challenges and Future Directions |
:sparkles: Welcome to contribute to this reading list via :memo: Issues using the following format.
Date | Authors | Venue | Paper |
---|---|---|---|
1706 | Vaswani, et al | NeurIPS | Attention Is All You Need |
๐ If you find our survey beneficial for your research, please kindly cite our paper :-)
@misc{tseng2024talespersonallmssurvey,
title={Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization},
author={Yu-Min Tseng and Yu-Chao Huang and Teng-Yun Hsiao and Wei-Lin Chen and Chao-Wei Huang and Yu Meng and Yun-Nung Chen},
year={2024},
eprint={2406.01171},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2406.01171},
}
<h2 id="authors">๐๏ธ Authors</h2>
Yu-Min Tseng*, Yu-Chao Huang*, Teng-Yun Hsiao*, Wei-Lin Chen*, Chao-Wei Huang, Yu Meng, Yun-Nung Chen.
(* Equal Contribution.) (Acknowlegement: Yu-Ching Hsu, Jia-Yin Foo.)
<div align="center"> <img src="figures/ntu-logo.png" width="20%" /> <img src="figures/uva-logo.png" width="25%" /> </div>