Awesome
<p align="center"> <img alt="Midscene.js" width="260" src="https://github.com/user-attachments/assets/bff5e76f-ea5c-42b7-bd12-0143a04671cf"> </p> <h1 align="center">Midscene.js</h1> <div align="center"> </div> <p align="center"> Joyful UI Automation </p> <p align="center"> <img src="https://img.shields.io/npm/v/@midscene/web?style=flat-square&color=00a8f0" alt="npm version" /> <img src="https://img.shields.io/npm/dm/@midscene/web.svg?style=flat-square&color=00a8f0" alt="downloads" /> <img src="https://img.shields.io/badge/License-MIT-blue.svg?style=flat-square&color=00a8f0" alt="License" /> </p>Midscene.js is an AI-powered automation SDK can control the page, perform assertions, and extract data in JSON format using natural language.
<p align="center"> <video src="https://github.com/user-attachments/assets/c9f1e7fa-9834-45ba-8e3b-c3fdc1f7e5bb" controls /> </p>Features ✨
- Natural Language Interaction 👆: Describe the steps, and let Midscene plan and control the user interface for you
- Understand UI, Answer in JSON 🔍: Provide prompts regarding the desired data format, and then receive the expected response in JSON format.
- Intuitive Assertion 🤔: Make assertions in natural language; it’s all based on AI understanding.
- Out-of-box LLM 🪓: It is fine to use public multimodal LLMs like GPT-4o. There is no need for any custom training.
- Visualized Report 🎞️: With our visualized report file, you can easily understand and debug the whole process.
- Brand New Experience! 🔥: Experience a whole new world of automation development. Enjoy!
Resources 📄
- Home Page: https://midscenejs.com
- Quick Experience By Chrome Extension
- API Reference
- Automate with Scripts in YAML
- Integrate with Puppeteer
- Integrate with Playwright
- Customize Model and Provider
License
Midscene.js is MIT licensed.