Home

Awesome

<img src="https://storage.googleapis.com/assistly/static/realchar/realchar.svg" height="24px" style="padding-top:4px"/>RealChar. - Your Realtime AI Character

<br/> <div align="center"> <img src="https://storage.googleapis.com/assistly/static/realchar/logo.png" alt="RealChar-logo" width="80%" style="padding: 40px"/> </div> <br/> <p align="center"> ๐ŸŽ™๏ธ๐Ÿค–<em>Create, customize and talk to your AI Character/Companion in realtime</em>๐ŸŽ™๏ธ๐Ÿค– </p> <div align="center"> <a href="https://realchar.ai/join-discord"> <img src="https://img.shields.io/badge/discord-join%20chat-blue.svg?style=for-the-badge" alt="Join our Discord" height="20"> </a> <a href="https://twitter.com/agishaun"> <img alt="Twitter Follow" src="https://img.shields.io/twitter/follow/agishaun?style=for-the-badge" height="20"> <a href="https://github.com/Shaunwei/RealChar"> <img alt="GitHub" src="https://img.shields.io/github/stars/Shaunwei/RealChar?style=for-the-badge&color=gold" height="20"> </a> <a href="https://github.com/Shaunwei/RealChar/commits/main"> <img alt="GitHub" src="https://img.shields.io/github/last-commit/Shaunwei/RealChar/main?style=for-the-badge" height="20"> </a> <a href="https://github.com/Shaunwei/RealChar/blob/main/README.md" target="_blank"> <img src="https://img.shields.io/static/v1?label=license&message=MIT&color=green&style=for-the-badge" alt="License" height="20"> </a> <a href="https://hub.docker.com/repository/docker/shaunly/real_char/general" target="_blank"> <img alt="Docker Pulls" src="https://img.shields.io/docker/pulls/shaunly/real_char?style=for-the-badge" height="20"> </a> </div>

โœจ Demo

Try our site at RealChar.ai

Not sure how to pronounce RealChar? Listen to this ๐Ÿ‘‰ audip

Demo 1 - with Santa Claus!

https://github.com/Shaunwei/RealChar/assets/5101573/6b35a80e-5503-4850-973d-254039bd383c

Demo 2 - with AI Elon about cage fight!

https://github.com/Shaunwei/RealChar/assets/5101573/5de0b023-6cf3-4947-84cb-596f429d109e

Demo 3 - with AI Raiden about AI and "real" memory

https://github.com/Shaunwei/RealChar/assets/5101573/62a1f3d1-1166-4254-9119-97647be52c42

Demo settings: Web, GPT4, ElevenLabs with voice clone, Chroma, Google Speech to Text

๐ŸŽฏ Key Features

๐Ÿ”ฌ Tech stack

<div align="center"> <img src="https://storage.googleapis.com/assistly/static/realchar/techstackv004.jpg" alt="RealChar-tech-stack" width="100%" style="padding: 20px"/> </div>

๐Ÿ“š Comparison with existing products

<div align="center"> <img src="https://storage.googleapis.com/assistly/static/realchar/compare.png"> </div>

๐Ÿ“€ Quick Start - Installation via Docker

  1. Create a new .env file

    cp .env.example .env
    

    Paste your API keys in .env file. A single ReByte or OpenAI API key is enough to get started.

    You can also configure other API keys if you have them.

  2. Start the app with docker-compose.yaml

    docker compose up
    

    If you have issues with docker (especially on a non-Linux machine), please refer to https://docs.docker.com/get-docker/ (installation) and https://docs.docker.com/desktop/troubleshoot/overview/ (troubleshooting).

  3. Open http://localhost:3000 and enjoy the app!

๐Ÿ’ฟ Developers - Installation via Python

Note if you want to remotely connect to a RealChar server, SSL set up is required to establish the audio connection.

๐Ÿ‘จโ€๐Ÿš€ API Keys and Configurations

1. LLMs

1.1 ReByte API Key

To get your ReByte API key, follow these steps:

  1. Go to the ReByte website and sign up for an account if you haven't already.
  2. Once you're logged in, go to Settings > API Keys.
  3. Generate a new API key by clicking on the "Generate" button.

1.2 (Optional) OpenAI API Token

<details><summary>๐Ÿ‘‡click me</summary> This application utilizes the OpenAI API to access its powerful language model capabilities. In order to use the OpenAI API, you will need to obtain an API token.

To get your OpenAI API token, follow these steps:

  1. Go to the OpenAI website and sign up for an account if you haven't already.
  2. Once you're logged in, navigate to the API keys page.
  3. Generate a new API key by clicking on the "Create API Key" button.

(Optional) To use Azure OpenAI API instead, refer to the following section:

  1. Set API type in your .env file: OPENAI_API_TYPE=azure

If you want to use the earlier version 2023-03-15-preview:

OPENAI_API_VERSION=2023-03-15-preview

  1. To set the base URL for your Azure OpenAI resource. You can find this in the Azure portal under your Azure OpenAI resource.

OPENAI_API_BASE=https://your-base-url.openai.azure.com

  1. To set the OpenAI model deployment name for your Azure OpenAI resource.

OPENAI_API_MODEL_DEPLOYMENT_NAME=gpt-35-turbo-16k

  1. To set the OpenAIEmbeddings model deployment name for your Azure OpenAI resource.

OPENAI_API_EMBEDDING_DEPLOYMENT_NAME=text-embedding-ada-002

</details>

1.3 (Optional) Anthropic(Claude 2) API Token

<details><summary>๐Ÿ‘‡click me</summary>

To get your Anthropic API token, follow these steps:

  1. Go to the Anthropic website and sign up for an account if you haven't already.
  2. Once you're logged in, navigate to the API keys page.
  3. Generate a new API key by clicking on the "Create Key" button.
</details>

1.4 (Optional) Anyscale API Token

<details><summary>๐Ÿ‘‡click me</summary>

To get your Anyscale API token, follow these steps:

  1. Go to the Anyscale website and sign up for an account if you haven't already.
  2. Once you're logged in, navigate to the Credentials page.
  3. Generate a new API key by clicking on the "Generate credential" button.
</details>

2. Speech to Text

We support faster-whisper and whisperX as the local speech to text engines. Work with CPU and NVIDIA GPU.

2.1 (Optional) Google Speech-to-Text API

<details><summary>๐Ÿ‘‡click me</summary>

To get your Google Cloud API credentials.json, follow these steps:

  1. Go to the GCP website and sign up for an account if you haven't already.
  2. Follow the guide to create a project and enable Speech to Text API
  3. Put google_credentials.json in the root folder of this project. Check Create and delete service account keys
  4. Change SPEECH_TO_TEXT_USE to use GOOGLE in your .env file
</details>

2.2 (Optional) OpenAI Whisper API

<details><summary>๐Ÿ‘‡click me</summary>

Same as OpenAI API Token

</details>

3. Text to Speech

Edge TTS is the default and is free to use.

3.1 (Optional) ElevenLabs API Key

<details><summary>๐Ÿ‘‡click me</summary>
  1. Creating an ElevenLabs Account

    Visit ElevenLabs to create an account. You'll need this to access the text to speech and voice cloning features.

  2. In your Profile Setting, you can get an API Key.

</details>

3.2 (Optional) Google Text-to-Speech API

<details><summary>๐Ÿ‘‡click me</summary>

To get your Google Cloud API credentials.json, follow these steps:

  1. Go to the GCP website and sign up for an account if you haven't already.
  2. Follow the guide to create a project and enable Text to Speech API
  3. Put google_credentials.json in the root folder of this project. Check Create and delete service account keys
</details>

(Optional) ๐Ÿ”ฅ Create Your Own Characters

<details><summary>๐Ÿ‘‡click me</summary>

Create Characters Locally

see realtime_ai_character/character_catalog/README.md

Create Characters on ReByte.ai

see docs/rebyte_agent_clone_instructions.md

</details>

(Optional) โ˜Ž๏ธ Twilio Integration

<details><summary>๐Ÿ‘‡click me</summary>

To use Twilio with RealChar, you need to set up a Twilio account. Then, fill in the following environment variables in your .env file:

TWILIO_ACCOUNT_SID=YOUR_TWILIO_ACCOUNT_SID
TWILIO_ACCESS_TOKEN=YOUR_TWILIO_ACCESS_TOKEN
DEFAULT_CALLOUT_NUMBER=YOUR_PHONE_NUMBER

You'll also need to install torch and torchaudio to use Twilio.

Now, you can receive phone calls from your characters by typing /call YOURNUMBER in the text box when chatting with your character.

Note: only US phone numbers and Elevenlabs voiced characters are supported at the moment.

</details>

๐Ÿ†•! Anyscale and LangSmith integration

<details><summary>๐Ÿ‘‡click me</summary>

Anyscale

You can now use Anyscale Endpoint to serve Llama-2 models in your RealChar easily! Simply register an account with Anyscale Endpoint. Once you get the API key, set this environment variable in your .env file:

ANYSCALE_ENDPOINT_API_KEY=<your API Key>

By default, we show the largest servable Llama-2 model (70B) in the Web UI. You can change the model name (meta-llama/Llama-2-70b-chat-hf) to other models, e.g. 13b or 7b versions.

LangSmith

If you have access to LangSmith, you can edit these environment variables to enable:

LANGCHAIN_TRACING_V2=false # default off
LANGCHAIN_ENDPOINT=https://api.smith.langchain.com
LANGCHAIN_API_KEY=YOUR_LANGCHAIN_API_KEY
LANGCHAIN_PROJECT=YOUR_LANGCHAIN_PROJECT

And it should work out of the box.

</details> <br/>

๐Ÿ“ Roadmap

$*$ These features are powered by ReByte platform.

๐Ÿซถ Contribute to RealChar

Please check out our Contribution Guide!

๐Ÿ’ช Contributors

<a href="https://github.com/Shaunwei/RealChar"> <img src="https://contrib.rocks/image?repo=Shaunwei/RealChar" /> </a>

๐ŸŽฒ Community