Awesome

Awesome-LLM-based-Web-Agent-and-Tools

A collection of some awesome public projects about LLM-based Web Agents and Tools (Continuously Update...).

Awesome-LLM-based-Web-Agent-and-Tools

Overview of LLM-based Web Agent

Detailed Explanation

(The following section was automatically generated by ChatGPT)

An LLM-based Web Agent can accomplish a variety of text-based tasks such as writing and proofreading articles, analyzing snippets of code, or even conducting advanced dialogues. Its purposes can range from personal assistance (scheduling, reminders, and searching for information) to professional tasks (technical analysis, summarizing reports, acting as a customer service representative, among other things).

The benefits of using LLMs as Web Agents include:

They can operate 24/7, providing continuous support.
They can handle a large volume of requests and responses.
They do not require breaks and cannot suffer from fatigue.

Despite these benefits, it's crucial to remember that they can't replace human judgement in sensitive or critical matters. They are most suitable for automating repetitive tasks, assisting in information extraction and providing guidance on widely covered topics.

To design a Web Agent using a Large Language Model, the following steps may be considered:

Define the tasks: Clearly specify the tasks the LLM-based Web Agent should perform.
Data Gathering and Training: Gather the relevant data, and train the LLM to perform the tasks it needs to accomplish.
Integration: Integrate the trained model into your web application.
Testing and Iteration: Conduct regular testing and improve the model for better performance.
Monitoring and Updating: Constantly monitor the model's performance and regularly update or re-train the model to adapt to changes.

Papers

format:
- [title](paper link) [links]
  - author1, author2, and author3...
  - publisher
  - keyword
  - code
  - experiment environments and datasets

2024

SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents
- Kanzhi Cheng, Qiushi Sun, Yougang Chu, Fangzhi Xu, Yantao Li, Jianbing Zhang, Zhiyong Wu
- Keyword: Large Language Model, Visual GUI Agents
- Code: official

2023

2022

2021

Codebases

format:
- [title](codebase link) [links]
  - author1, author2, and author3...
  - keyword
  - experiment environments, datasets or tasks

Dataset (Benchmark)

format:
- [title](benchmark link) [links]
  - author1, author2, and author3...
  - keyword
  - experiment environments or tasks

Tools

format:
- [title](tool link) [links]
  - author1, author2, and author3...
  - keyword
  - experiment environments, datasets or tasks

2024

Llama3
- Meta
- Large Language Model
- Huggingface Site: official
Llama3-Chinese-Chat
- Shenzhi-Wang
- Large Language Model, Chinese chat fine-tuned
- Huggingface Site: official
Unichat-llama3-Chinese
- 中国联通AI创新中心
- Large Language Model, Chinese chat fine-tuned
- Huggingface Site: official
Skyvern
- Skyvern-AI
- Automate browser-based workflows with LLMs and Computer Vision
- Huggingface Site: official

Awesome

Awesome-LLM-based-Web-Agent-and-Tools

Table of Contents

Overview of LLM-based Web Agent

Detailed Explanation

Papers

2024

2023

2022

2021

Codebases

Dataset (Benchmark)

Tools

2024

2023

Blogs