Table of Contents
- 👋 Getting Started & Join TEN Community
- ✨ Features
- 💡 TEN Agent Usecases
- 🔌 Ready to use Extensions
- 🎮 TEN Agent Playground
- 🎥 Run Agent Demo
- ️🛳️ Deployment
- 🏗️ TEN Agent Architecture
- 🌍 TEN Framework Ecosystem
- 🤝 Contributing
TEN Agent is a conversational voice AI agent powered by TEN, integrating DeepSeek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaking, and is fully compatible with platforms like Dify and Coze.
Important
Star Our Repository ⭐️
Get instant notifications for new releases and updates. Your support helps us grow and improve TEN Agent!
Star History
Build engaging AI avatars with TEN Agent using Trulience's diverse collection of free avatar options. To get it up and running, you only need 2 steps:
- Follow the README to finish setting up and running the Playground in
localhost:3000
- Enter the avatar ID and token you get from Trulience
TEN Agent + Trulience
TEN is a very versatile framework. That said, TEN Agent is compatible with DeepSeek R1, try experiencing realtime conversations with DeepSeek R1!
TEN Agent is now running on the Espressif ESP32-S3 Korvo V3 development board, an excellent way to integrate realtime communication with LLM on hardware.
Try Google Gemini Multimodal Live API with realtime vision and realtime screenshare detection capabilities, it is a ready-to-use extension, along with powerful tools like Weather Check and Web Search integrated perfectly into TEN Agent.
Gemini 2.0 Multimodal Live API
Describe a topic and ask TEN Agent to tell you a story while also generating images of the story to provide a more immersive experience for kids.
Storyteller + Image Generator
TEN offers a great support to make the realtime interactive experience even better on other LLM platform as well, check out docs for more.
TEN Agent + Dify with RAG
TEN offers a great support to make the realtime interactive experience even better on other LLM platform as well, check out docs for more.
Category | Requirements |
---|---|
Keys | • Agora App ID and App Certificate (free minutes every month) • OpenAI API key (any LLM that is compatible with OpenAI) • Deepgram ASR (free credits available with signup) • Elevenlabs TTS (free credits available with signup) |
Installation | • Docker / Docker Compose • Node.js(LTS) v18 |
Minimum System Requirements | • CPU >= 2 Core • RAM >= 4 GB |
Note
macOS: Docker setting on Apple Silicon
Uncheck "Use Rosetta for x86/amd64 emulation" in Docker settings, it may result in slower build times on ARM, but performance will be normal when deployed to x64 servers.
cp ./.env.example ./.env
AGORA_APP_ID=
AGORA_APP_CERTIFICATE=
docker compose up -d
docker exec -it ten_agent_dev bash
check the /examples
folder for more examples
task use
task run
- Open Up localhost:3000 and select a graph example
- Choose a corresponding module
- Select an extension and configure its API key and settings
Module Picker Example
Now, we have successfully set up the playground. This is just the beginning of TEN Agent. There are many different ways to explore and utilize TEN Agent. To learn more, please refer to the documentation.
GitHub offers free Codespace for each repository, you can run the playground in Codespace without using Docker.Also, the speed of Codespace is much faster than localhost.
Check out this guide for more details.
Playground and Demo server different purposes, in a nut shell, think it as Playground is for you to customize you agent, and Demo is for you to deploy your agent.
Check out this guide for more details.
Once you have customized your agent (either by using the playground or editing property.json
directly), you can deploy it by creating a release Docker image for your service.
Read the Deployment Guide for detailed information about deployment.
coming soon...
🏚️ TEN Framework TEN, a AI agent framework to create various AI agents which supports real-time conversation. |
![]() |
---|---|
🎙️ TEN Agent TEN Agent is a conversational voice AI agent powered by TEN, integrating Deepseek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaking, and is fully compatible with platforms like Dify and Coze. |
![]() |
🎨 TMAN Designer alpha TMAN Designer is low/no code option to make a cool voice agent. With it's easy-to-use workflow UI, you can build things easily. It comes with runtime, dark/light themes, integrated editors and integrated terminals. |
![]() |
📒 TEN Portal The official site of TEN framework, it has documentation, blog and showcases. |
![]() |
We welcome all forms of open-source collaboration! Whether you're fixing bugs, adding features, improving documentation, or sharing ideas - your contributions help advance personalized AI tools. Check out our GitHub Issues and Projects to find ways to contribute and show your skills. Together, we can build something amazing! 🤝✨
Contributions are welcome! Please read the contribution guidelines first.
This project is Apache 2.0 licensed.