Home

Awesome

Awesome Large Multimodal Agents

Last update: 09/25/2024

<img src="./img/time.png" width="96%" height="96%">

<font size=5><center><b> Table of Contents </b> </center></font>


Papers

Taxonomy

<img src="./img/table.png" width="96%" height="96%">

Type Ⅰ

Type Ⅱ

Type Ⅲ

Type Ⅳ

Multi-Agent

Application

<img src="./img/app.png" width="96%" height="96%">

💡 Complex Visual Reasoning Tasks

🎵 Audio Editing & Generation

🤖 Embodied AI & Robotics

🖱️💻 UI-assistants

🎨 Visual Generation & Editing

🎥 Video Understanding

🚗 Autonomous Driving

🎮 Game-developer

Other

Benchmark