Home

Awesome

Awesome GUI Agent Paper List

This repo covers a variety of papers related to GUI Agents, such as:

keyword_wordcloud_long.png

Papers Grouped by Environments

WebMobileDesktopGUIMisc

(Misc: Papers for general topics that have important applications in GUI agents.)

Papers Grouped by Keywords

framework (107) | benchmark (68) | dataset (63) | model (31) | reinforcement learning (13) | safety (11) | visual grounding (8) | planning (7) | reasoning (6) | grounding (5) | vision language model (5) | survey (4) | attack (4) | learning (4) | evaluation (4) | synthetic data (3) | foundation model (3) | UI understanding (3) | self-improvement (3) | programming-by-demonstration (3)

Papers Grouped by Authors

Yu Su (9) | Graham Neubig (8) | Huan Sun (8) | Tianbao Xie (7) | Tao Yu (7) | Boyuan Zheng (7) | Shuyan Zhou (7) | Xiao Liu (6) | Hanyu Lai (6) | Jie Tang (6) | Yuxiao Dong (6) | Difei Gao (5) | Mike Zheng Shou (5) | Zhiyong Wu (5) | Daniel Fried (5) | Toby Jia-Jun Li (5) | Ruslan Salakhutdinov (4) | Caiming Xiong (4) | Boyu Gou (4) | Yu Gu (4)

All Papers (from most recent to oldest)

<details open> <summary>Papers</summary> </details>

How to Add a Paper or Update the README

Please fork and update:

πŸ€– You can use this GPTs to quickly search and get a formatted paper entry automatically by inputting a paper name. Or you can simply leave a comment in an issue.

<details> <summary>Format example and explanation</summary>
- [title](paper link)
    - List authors directly without a "key" identifier (e.g., author1, author2)
    - πŸ›οΈ Institutions: List the institutions concisely, using abbreviations (e.g., university names, like OSU).
    - πŸ“… Date: e.g., Oct 30, 2024
    - πŸ“‘ Publisher: ICLR 2025
    - πŸ’» Env: Indicate the research environment within brackets, such as [Web], [Mobile], or [Desktop]. Use [GUI] if the research spans multiple environments. Use [Misc] if it is researching in general domains.
    - πŸ”‘ Key: Label each keyword within brackets, e.g., [model], [framework],[dataset],[benchmark].
    - πŸ“– TLDR: Brief summary of the paper.

Regarding the πŸ”‘ Key:

KeyDefinition
modelIndicates a newly trained model.
frameworkIf the paper proposes a new agent framework.
datasetIf a new (training) dataset is created and published.
benchmarkIf a new benchmark is established (also add "dataset" if there's a new training set).
primary studiesList the main focus or innovation in the study.
AbbreviationsInclude commonly used abbreviations associated with the paper (model names, framework names, etc.).

For missing information, use "Unknown."

</details>