Home

Awesome

<div align="center"> <h1>Awesome AI Tools</h1> <a href="https://awesome.re"><img src="https://awesome.re/badge.svg"/></a> </div>

English | 中文

This repo collects AI-related utilities.

<a href="https://www.buymeacoffee.com/ikaijuaawesomeaitools" target="_blank"><img src="https://cdn.buymeacoffee.com/buttons/default-orange.png" alt="Buy Me A Coffee" height="41" width="174"></a>

All Categories

ChatGPT and other closed-source LLMs

NameDescriptionLinksFees
ChatGPTOpenAI's chatgptURLFree/Paid
ClaudeAnthropic's AI assistantURLFree/Paid
GeminiGoogle's conversational, AI chat service. Google's latest LLM, including Gemini Nono, Gemini Pro and Gemini Ultra. Gemini Pro is open for api and sdk use. Gemini is built from the ground up for multimodality — reasoning seamlessly across text, images, video, audio, and codeURL <br> dev: URLFree
GrokxAI's AI assistantURLFree
Microsoft CopilotMicrosoft's AI assistant.URLFree
Le ChatMistral.ai's conversational, AI chat serviceURLFree

AI Search engine

NameDescriptionLinksFees
Perplexity.aiAI-driven conversational search engine.URLFree
You.comA search engine in conversation modeURLFree

Open Source LLMs

NameDescriptionLinksFees
Llama 3Llama3 is a large language model developed by Meta AI. It is the successor to Meta's Llama2 language model. <br>Online test address:<br>huggingface.co/Meta-Llama-3-70B-InstructGitHub GitHub Repo starsFree
MixtralMixtral 8x7B, a high-quality sparse mixture of experts model (SMoE) with open weights. Mixtral outperforms Llama 2 70B on most benchmarks with 6x faster inference. It matches or outperforms GPT3.5 on most standard benchmarks. <br>paper:https://arxiv.org/pdf/2401.04088.pdf <br>news:https://mistral.ai/news/mixtral-of-experts/mistral-inference GitHub Repo stars<br>mistral-finetune GitHub Repo starsFree
grok-1A large language model open sourced by xAIGithub GitHub Repo starsFree
Phi-3Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.Github GitHub Repo starsFree

GPT LLMs Applications

NameDescriptionLinksFees
PoeAI product built by Quora. Can use ChatGPT, Sage, Dragonfly, Claude bots for free. All you need is an email address to register. GPT-4 can be used once a day for freeURLFree, with paid upgrades
HuggingChatOpen source codebase powering the HuggingChat app. URLGithub GitHub Repo starsFree
Google AI StudioGoogle AI Studio is a free, web-based developer tool that enables you to quickly develop prompts and then get an API key to use in your app developmentURLFree
NotebookLMAI Research Assistant developed by Google. Upload PDFs, websites, YouTube videos, audio files, Google Docs, or Google Slides, and NotebookLM will summarize them and make interesting connections between topics. Audio Overview feature can turn your sources into engaging “Deep Dive” discussions with one click.URLFree
Learn aboutAI learning Assistant developed by Google.Grasp new topics and deepen your understanding with a conversational learning companion that adapts to your unique curiosity and learning goals.URLFree
monicaAI assistant that provides help with a variety of tasks such as searching, reading, writing, translating, drawing, and more. Standalone apps and browser plug-ins availableURL <br> chrome extensionFree, with paid upgrades
ollamaGet up and running with Llama 2, Mistral, Gemma, and other large language models.Github GitHub Repo starsFree
openai/openai-pythonThe official Python library for the OpenAI API, It is generated from OpenAPI specification with StainlessGithubGitHub Repo starsFree, need OpenAPI apikey
sashabaranov/go-openaiThis library provides unofficial Go clients for OpenAI API. support: ChatGPT, GPT-3, GPT-4, DALL·E 2GithubGitHub Repo starsFree
langchainLangChain is a framework for developing applications powered by language models.Github GitHub Repo starsFree
Helicone AIHelicone is the open-source LLM observability platform for logging, monitoring, and debugging AI applications.Github GitHub Repo starsFree
ChatGPT-Next-WebOne-Click to get a well-designed cross-platform ChatGPT web UI, with GPT3, GPT4 & Gemini Pro support.Github GitHub Repo starsFree
screenshot-to-codeThis simple app converts a screenshot to HTML/Tailwind CSS. It uses GPT-4 Vision to generate the code and DALL-E 3 to generate similar-looking images. You can now also enter a URL to clone a live website!GitHub GitHub Repo starsFree, need access to GPT-4 Vision
ChatboxDesktop application that uses ChatGPT API (OpenAI API) to store all chat messages and prompts locally, thus reducing the risk of data loss. A bit more stable to use than the web versionGitHub GitHub Repo starsFree, requires apikey with OpenAPI
gpt-crawlerCrawl a site to generate knowledge files to create your own custom GPT from a URLGithubGitHub Repo starsFree
ChatGPT-ShortcutOpen source, ChatGPT shortcut commands that double productivity, partitioned by domain and function, can filter prompt words by tag, keyword search and one-click copy.GitHub GitHub Repo starsFree
ChatGPT SidebarChatGPT Sidebar is an artificial intelligence assistant you can use while browsing any website.URLFree
WebChatGPTOpen source, expand the ability of networking to chatgptGitHub </br>GitHub Repo starsFree
AIPRM for ChatGPTBrowser plug-in, providing a series of selected ChatGPT instruction templates, and even creating your own, and adjusting AI tone and writing styleURLFree
GPTCache⚡ GPTCache is a library for creating semantic cache to store responses from LLM queries. It can be used to speed up and lower the cost of chat applications that rely on the LLM service. And it's similar to redis in an aigc scenario.Github </br>GitHub Repo starsFree
MindMacFeature-rich & privacy-first native ChatGPT app for macOS to use OpenAI, Azure OpenAI, Anthropic Claude, OpenRouter all in one place, designed for maximum productivity. Currently available in 15 languages.URLFree, with paid upgrades
MemFreeOpen Source Hybrid AI Search Engine, Instantly Get Accurate Answers from the Internet, Bookmarks, Notes, and Docs. Support One-Click Deployment.Github </br>GitHub Repo starsFree & Suport one-click self-host

AI Image Creation

NameDescriptionLinksFees
MidjourneyEnter text or pictures to create picturesURLFree account has a certain usage minutes limit, and there is a paid upgrade version
Photoshop AIAdobe Photoshop generative-fillURLPaid
Stable diffusion webuiOpen source project, input text or pictures to create pictures, Stable diffusion webui is the GUI of Stable diffusion, and it is an image user interface that visualizes stable diffusion. It also integrates many other useful extension scripts.GitHub </br> GitHub Repo starsFree
civitaicivitai.com is a website platform for sharing AI image creation model resources, with a large number of models, has become the main model exchange place in the SD open source communityURLFree
clipdropclipdrop by stability.ai. Has many AI image processing tools, such as stable diffusion XL, uncrop, reimage XL, stable doodle.URLFree/Paid
fireflyAdobe's AI image processing web siteURLFree/Paid
ideogram.aiEnter text to create pictures. A product developed by a company founded by many ex-GooglersURLFree/Paid
Skybox AIGenerate 360-degree panoramic images using text promptsURLFree/Paid
DragGANInteractive Point-based Manipulation on the Generative Image ManifoldGitHub </br> GitHub Repo starsFree
visual-chatgptCreate images with ChatGPTGitHub </br> GitHub Repo starsFree
Microsoft Bing Image CreatorImage Creator is a tool for creating pictures using DALL-E technology. Tried Generating portrait pictures is unsightlyURLFree
remove.bgRemove Image BackgroundURLFree/Paid
ControlNetControlNet is a neural network structure to control diffusion models by adding extra conditions.Github GitHub Repo starsFree
StreamDiffusionA Pipeline-Level Solution for Real-Time Interactive GenerationGithub GitHub Repo starsFree

Video Creation

NameDescriptionLinksFees
SoraSora is an AI model published by OpenAI that can create realistic and imaginative scenes from text instructions.URLPaid
KLING AIAI Video Creation Tool by kuaishou.URLFree/Paid
hailuoaiAI Video Creation Tool by MinimaxURLFree/Paid
Dream MachineBy Luma AI. Dream Machine is an AI model that makes high quality, realistic videos fast from text and images.Official introductory videoURLFree/Paid
capcutSubtitle-generated speech, speech recognition, and very convenient and powerful video editingURLFree/Paid
RunwayGen-2: Text/Image to video <br> Gen-1: Video to video. Featured video: https://runwayml.com/staff-picksURLPaid/Free trial
PikaText/Image to videoURLPaid/Free trial
FlikiA website that converts text into audio and videoURLFree/Paid
d-idGenerate digital human dubbing video based on textURLPaid/Free trial
HeyGenGenerate digital human dubbing video based on textURLPaid/Free trial
AnimateDiffAnimateDiff is a plug-and-play module turning most community models into animation generators, without the need of additional training.Github GitHub Repo starsFree
vivago.ai/videoText to Video; Image to Video; 4K enhanceURLFree

AI Cloud Platform

NameDescriptionLinksFees
together.aiThe AI Acceleration Cloud. Train, fine-tune-and run inference on AI models blazing fast, at low cost, and at production scale.URLFree/Paid

LLM Prompts

NameDescriptionLinksFees
f/awesome-chatgpt-promptsThis repo includes ChatGPT prompt curation to use ChatGPT better.Github GitHub Repo starsFree

LLM Leaderboard

NameDescriptionLinksFees
LMSYS Chatbot Arena LeaderboardLMSYS Chatbot Arena is a crowdsourced open platform for LLM evals. Collected over 1,000,000 human pairwise comparisons to rank LLMs with the Bradley-Terry model and display the model ratings in Elo-scale.URLFree
Artificial AnalysisArtificial Analysis is a platform that provides AI model and service provider comparisons and benchmarks to help users make informed decisions when choosing AI models and service providers. The platform provides comparative data on a wide range of popular AI models, including OpenAI's GPT-4, Meta's Llama 3, and Anthropic's Claude series, covering performance metrics such as response time, latency, and cost.URLFree

LLM training platform

NameDescriptionLinksFees
lm-sys/FastChatAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.Github GitHub Repo starsFree

Applications that integrate multiple LLMs

NameDescriptionLinksFees
chathubUse different chatbots in one app, currently supporting ChatGPT, new Bing Chat, Google Bard, Claude, and 10+ open-source models including Alpaca, Vicuna, ChatGLM etc.GitHub </br>GitHub Repo starsFree/Paid
ChatALLConcurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, and more, discover the best answersGitHub </br> GitHub Repo starsFree
HarborEffortlessly run LLM backends, APIs, frontends, and services with one command.GitHub </br> GitHub Repo starsFree

AI Agent

NameDescriptionLinksFees
Auto-GPTOpen source, An experimental open-source attempt to make GPT-4 fully autonomous.GitHub </br> GitHub Repo starsFree
OthersideAI/self-operating-computerA framework to enable multimodal models to operate a computer.Github GitHub Repo starsFree,GPT-4v required
AppAgentMultimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.Github GitHub Repo starsFree
microsoft/autogenAutoGen is an open-source programming framework for building AI agents and facilitating cooperation among multiple agents to solve tasks.Github GitHub Repo starsFree
potpie-ai/potpieOpen Source AI Agents for your codebase in minutes. Use pre-built agents for Q&A, Testing, Debugging and System Design or create your own purpose-built agents.URL , Github GitHub Repo starsFree Trial
saplingsA framework for building agents that use search algorithms to complete tasks.Github GitHub Repo starsFree

Writing

NameDescriptionLinksFees
Notion AIAI-assisted note-taking softwareURLwith certain free AI trials, AI features $10/month
Deep L WriteEnglish and German writing tools to fix writing errors and rewrite sentences promptly.URLFree version to use with text word limit / paid upgrade available
grammarlyEdit and correct your grammar, spelling, punctuation, and more with your personal writing assistant, grammar checker, and editor.URLFree/Paid
TextCraftAdd-in for Microsoft Word that seamlessly integrates essential AI tools, including text generation, proofreading, and more, directly into the user interface.URLFree

Programming Development

NameDescriptionLinksFees
GitHub CopilotA code writing assistant developed by GitHub and OpenAIURLPaid
CursorA collaborative code editor using GPTURLPaid/Free Trial
MarsCodeBuilt-in AI programming assistant with capabilities like code completion, explanation, and debugging for faster development.URLFree
ai-code-translatorOpen source project. Translates code from one language to another using chatgpt.GitHub </br> GitHub Repo starsFree, requires OpenAI API key
Amazon CodeWhispererA code writing assistant developed by AmazonURLFree for Individual Use
gpt-engineerGPT Engineer is made to be easy to adapt, extend, and make your agent learn how you want your code to look. It generates an entire codebase based on a prompt.GitHub GitHub Repo starsFree
CodeiumPowerful in-IDE AI coding assistantURLFree/Paid
scaleneScalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposalsGithub </br>GitHub Repo starsFree
Fitten CodeFitten Code is an AI programming assistant driven by Fitten LLM models, which can automatically generate code, improve development efficiency, help you debug, and save your time. It can also chat with you and solve your programming problems.freeand supports over 80 languages: Python, C++,JavaScript, TypeScript, Java, etc. Fitten Code supports Visual Studio Code and JetBrains series IDEs, including IntelliJ IDEA, PyCharm, WebStorm, etc.URLFree
flappyProduction-Ready LLM Agent SDK for Every DeveloperGitHub GitHub Repo starsFree
PlandexOpen source, terminal-based AI programming engine for complex tasksGitHub GitHub Repo starsFree
Mistral/CodestralEmpowering developers and democratising coding with Mistral AI., models:https://huggingface.co/mistralai/Codestral-22B-v0.1URLFree

Translation

NameDescriptionLinksFees
immersive-translateOpen source project. Immersive bilingual web translation extensionGitHub </br> GitHub Repo starsFree
Deep LAccurate and instant translation tool, currently supporting 31 languagesURLFree/Paid
openai-translatorOpen source project. Crossword translation browser plugin and cross-platform desktop application based on ChatGPT APIGitHub </br> GitHub Repo starsFree, requires OpenAI API key

AI Conversation or AI Voice Conversation

NameDescriptionLinksFees
pi.aiAn AI that's been shown to be very good at chatting, so you don't have to worry about talking all day. It supports both text and speech. Voice input is required with Apple's input system. Good for practicing English conversation and listening.URLFree
Voice Control for ChatGPTThis Chrome extension allows you to have voice conversations with ChatGPT.URLFree, requires chatgpt account
SpeechGPTSpeechGPT is a web application that enables you to converse with ChatGPT.GitHub </br> GitHub Repo starsFree,requires OpenAI API key

Speech Recognition

NameDescriptionLinksFees
whisperOpenAPI open source robust speech recognition model through large-scale weak supervisionGitHub </br> GitHub Repo starsFree
buzzAn open source desktop software based on OpenAI's Whisper to recognize speech and generate subtitlesGitHub </br> GitHub Repo starsFree
WhisperDesktopOpen source, OpenAI-based Whisper, a desktop application for Windows, uses the GPU for processing, which will be faster than on the CPU with good GPU performance.GitHub GitHub Repo starsFree
whisperXWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)whisperX GitHub Repo starsFree
whisper-webML-powered speech recognition directly in your browser. Built with Transformers.js. DemoGitHub GitHub Repo starsFree

Text To Speech

NameDescriptionLinksFees
Azure Text to speechThe best and most realistic voice tools currently availableURLPaid / 500,000 characters per month free
coqui-ai/ttsA deep learning toolkit for Text-to-Speech, battle-tested in research and production <br> Online Demo: https://huggingface.co/spaces/coqui/xttsGithub GitHub Repo starsFree
elevenlabsIntelligent AI Text to SpeechURLFree/Paid
netease-youdao/EmotiVoiceA Multi-Voice and Prompt-Controlled TTS Engine. EmotiVoice speaks both English and Chinese, and with over 2000 different voices. The most prominent feature is emotional synthesis, allowing you to create speech with a wide range of emotions, including happy, excited, sad, angry and others.Github GitHub Repo starsFree
tetosA unified interface for multiple Text-to-Speech (TTS) providers. Supported TTS providers: Edge TTS, OpenAI TTS, Azure TTS, Google TTS, Volcengine TTS, Baidu TTSGithub GitHub Repo starsFree
ChatTTSChatTTS is a text-to-speech model designed specifically for dialogue scenario such as LLM assistant. It supports both English and Chinese languages. Our model is trained with 100,000+ hours composed of chinese and english. Website:https://chattts.com/GithubGitHub Repo starsFree

Music Recognition

NameDescriptionLinksFee
shazamDownload the shazaom app for music recognition, which is pretty fastURLFree

Voice Processing

NameDescriptionLinksFees
so-vits-svcSoftVC VITS Singing Voice Conversion.GitHub GitHub Repo starsFree
vocalremoverExtract vocal and musicURLFree
lala.aiExtract vocal, accompaniment and various instruments from any audio and videoURLFree/Paid

AI generated music or sound effects

NameDescriptionLinkFees
suno.aiThe AI music creation tool Suno can generate custom songs based on text prompts in mere second You can create your own AI songs with this new Copilot extensionURL
udioCreate music from simple text prompts by specifying topics, genres, and other descriptors which are then transformed into professional quality tracks.URL
elevenlabs/sound-effectsImagine a sound and bring it to life, or explore a selection of the best sound effects generated by the community.URLFree
suno-ai/barkBark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects.Github GitHub Repo starsFree
audiocraftOpen source library for audio/music generation by Meta, which mainly includes two models, MusicGen: text-to-music model, AudioGen: text-generated sound model. MusicGen Online DemoGitHub </br> GitHub Repo starsFree
Stable AudioAI music and sound effect generation application by stability.aiURLFree/Paid
OptimizerAISound effect generation <br>Official IntroductionURLFree/Paid
SFX EngineAI Sound effect generationURLFree/Paid

Speech translation

NameDescriptionLinksFees
SeamlessSeamless is a family of AI models that enable more natural and authentic communication across languages.Online DemoGithub GitHub Repo starsFree

Video Content Summary

NameDescriptionLinksFees
ChatGPT for YouTubeChrome plugin, quickly summarize Youtube video content, need to log in chatgpt account or apikeyURLFree
Chat YoutubeGive a Youtube link, it will give a summary, and you can ask it questions about the content of the videoURLFree

OCR

NameDescriptionLinksFees
Umi-OCRComes with a highly efficient offline OCR engine. As long as the computer performance is sufficient, it can be faster than online OCR services.Github GitHub Repo starsFree

Star History

Awesome-AITools Discord Link: https://discord.gg/7hAvJQME