Home

Awesome

<div align="center"><a name="readme-top"></a> <a href="https://x.com/Lafe8088" target="_blank"> <img src="/docs/images/top.png" alt="alt text"> </a> <h1>QMedia</h1> <h3> AI content search engine designed specifically for content creators. <br /> </h3> <div style="text-align: center;">

English | 简体中文

Changelog - Report Issues - Request Feature

Twitter <a href="https://x.com/LinkLin1987"><img src="https://img.shields.io/badge/Follow-%40LinkLin-1DA1F2?logo=twitter&style={style}"></a> License: MIT <a href="https://discord.gg/bkU2K7GjAb"><img src="https://img.shields.io/discord/1245752894389489704?style=social&logo=discord"></a>

</div> <div align="left">

Key Features

<details open="open"> <summary>Directory</summary> <!-- ============================================ -->

👋🏻 Introduction

QMedia is an open-source multimedia AI content search engine , provides rich information extraction methods for text/image and short video content. It integrates unstructured text/image and short video information to build a multimodal RAG content Q&A system. The aim is to share and exchange ideas on AI content creation in an open-source manner. issues

Share QMedia with your friends.

Spark new ideas for content creation

<div align="center"> <a href="https://discord.gg/bkU2K7GjAb"><img src="https://img.shields.io/discord/1245752894389489704?style=social&logo=discord"></a> </div>Join our Discord community!
alt textJoin our WeChat group !
<br/> <div align="right">

</div> <!-- ============================================ -->

💫 Feature Overview

<div align="right">

</div> <!-- ============================================ -->

Future Plans

<div align="right">

</div>
<!-- ============================================ -->

🤖 Installation

File Structure Introduction

QMedia services: Depending on resource availability, they can be deployed locally or the model services can be deployed in the cloud

mm_server Installation

mmrag_server Installation

qmedia_web Installation

<div align="right">

</div>

⭐️ Usage

Combined Usage

mm_server + qmedia_web + mmrag_server Web Page Content Display, Content RAG Search and Q&A, Model Service

  1. Service Startup Process:
# Start mm_server service
cd mm_server
source activate qllm
python main.py

# Start mmrag_server service
cd mmrag_server
source activate qmedia
python main.py

# Start qmedia_web service
cd qmedia_web
pnpm dev
  1. Using Functions via the Web Page During the startup phase, mmrag_server will read pseudo data from assets/medias and assets/mm_pseudo_data.json, and call mm_server to extract and structure the information from text/image and short videos into node information, which is then stored in the db. The retrieval and Q&A will be based on the data in the db.

Custom Data

# assets file structure
assets
├── mm_pseudo_data.json # Content card data
└── medias # Image/Video files

Replace the contents in assets and delete the historically stored db file. assets/medias contains image/video files, which can be replaced with your own image/video files. assets/mm_pseudo_data.json contains content card data, which can be replaced with your own content card data. After running the service, the model will automatically extract the information and store it in the db.

<div align="right">

</div>

Independent Model Service

Can use the mm_server local image/text/video information extraction service independently. It can be used as a standalone image encoding, text encoding, video transcription extraction, and image OCR service, accessible via API in any scenario.

# Start mm_server service independently
cd mm_server
python main.py

# uvicorn main:app --reload --host localhost --port 50110

API Content:

alt text

<br/>

Pure Python RAG Service/Model Service

Can use mm_server + qmedia_web together to perform content extraction and RAG retrieval in a pure Python environment via APIs.

# Start mmrag_server service independently
cd mmrag_server
python main.py

# uvicorn main:app --reload --host localhost --port 50110

API Content:

alt text

<div align="right">

</div>

Star History

Star History Chart

License

QMedia is licensed under MIT License

Acknowledgments

Thanks to QAnything for strong OCR models.

Thanks to llava-llama3 for strong llm vision models.