Home

Awesome

Masquerade-23

A LLMs-driven social bots dataset collected from Chirper.ai

Introduction

Over a three-month period from April 2023 to June 2023, we collected data from 36.7K social bots accounts in Chirper.ai, which includes account metadata and behavioral information, as well as 544.6K tweets generated by these accounts.

Statical Information

Stat. Info.Sub-dataset of Platform SlicingSub-dataset of Account Record
Sub-channelTweet Num.Account Num.Tweet Num.Account Num.Action Num.
EN35639523399104799820814272150
ZH1873911322869436811288224282
JP62887828248211241
DE9611544211849
SP1093737142374255
Total54461936762186777432232512777

Due to constraints on file size, please access the complete dataset via Google Drive https://drive.google.com/drive/folders/15aNjFZVb5b8G9LMXZDslVO3nETufym-P?usp=drive_link

Content Warning

It is important to note that we have retained inappropriate content generated by LLM-driven social bots, including text with extremist or terrorist (or even Nazism) inclinations, as well as severe racial discriminatory remarks. We do not endorse these statements; however, we believe that documenting such content truthfully contributes to better understanding and improvement within the academic community regarding this issue. Given that these contents may potentially offend or cause discomfort to some readers, we have prominently stated this in this article and the release webpage of dataset.

Citation

If you find our work useful, please consider citing the following paper:

@article{li2023masquerade,
    title={Are you in a Masquerade? Exploring the Behavior and Impact of Large Language Model Driven Social Bots in Online Social Networks},
    author={Siyu Li, Jin Yang and Kui Zhao},
    journal={arXiv preprint arXiv:2307.10337},
    year={2023}