Awesome

</div>

TEN stands for Transformative Extensions Network, is a voice agent framework to create conversational AI.

Features

The TEN framework offers the following advantages:

Native Support for High-Performance, Real-Time Multimodal Interactions

If your AI applications involve complex audio-visual scenarios, TEN is your go-to solution. It offers high performance and low latency, with extensive optimization of interactions between various extensions to ensure efficient development of AI applications.
Supports Multiple Languages and Platforms

Developers can create modular and reusable extensions using various programming languages, such as C++, Go, and Python (with future support for JavaScript/TypeScript). Moreover, the TEN framework runs seamlessly across platforms, including Windows, Mac, Linux, and mobile devices.
Edge-Cloud Integration

Through the TEN framework, extensions deployed across edge and cloud environments can be easily combined to create diverse applications and scenarios. For privacy-sensitive edge deployments, small models leverage local compute power for reduced costs and lower latency, while cloud-based large models can be integrated for an optimal balance of cost and performance.
Flexibility Beyond Model Limitations

The TEN framework allows for the creation of complex AI applications that transcend the limitations of large models alone. Agents can be easily constructed to meet a wide range of needs using a simple drag-and-drop, responsive programming approach. TEN also facilitates the integration of AI with audio-visual tools, databases, monitoring systems, RAG, and more.
Real-Time Agent State Management

TEN has the ability to manage real-time agent states, enabling dynamic responsiveness and adjustment of agent behavior in real time.
And more...

For more information and detailed documentation on all the capabilities of the TEN framework, please refer to the TEN framework documentation site.

Use Cases

With the TEN framework, you can easily accomplish the following scenarios. You can see actual demos at TEN Agent:

Simultaneous interpretation

Real-time language translation during live conversations, enabling smooth cross-language communication without delays.
Speech-to-text conversion

Convert spoken language into written text, making it useful for transcribing meetings, interviews, or live talks.
Multilingual chat rooms

Create chat rooms where users can communicate in different languages, with automatically translating messages in real time to foster seamless interaction.
Audio interaction

Enabling users to communicate with the AI using audio instead of text, which is ideal for hands-free communication or enhancing accessibility.
Audio-visual interaction

Combine audio and visual elements to create interactive experiences, such as video conferences with integrated real-time transcription, translation, or even interactive media content.
And more...

The flexibility of the TEN framework enables developers to build additional interactive solutions, such as virtual assistants, automated customer support systems, and dynamic multimedia applications.

Documentation

Please visit our TEN framework documentation site for more information.

Stay Tuned

Before we get started, be sure to star our repository and get instant notifications for all new releases!

star_ten_w_confetti

License Information

The entire TEN framework (except for the folders explicitly listed below) is released under the Apache License, Version 2.0, with additional restrictions. For details, please refer to the LICENSE file located in the root directory of the TEN framework.
The components within the packages directory are released under the Apache License, Version 2.0. For details, please refer to the LICENSE file located in each package's root directory.
The third-party libraries used by the TEN framework are listed and described in detail. For more information, please refer to the dependencies.md file located in the docs/ten_framework directory.