TEN-framework/TEN-Agent
Fork: 319 Star: 3355 (更新于 2024-12-18 12:29:30)
license: Apache-2.0
Language: Python .
TEN Agent is a conversational AI powered by TEN, integrating Gemini 2.0 Multimodal Live API, OpenAI Realtime API, RTC, and more. It offers real-time capabilities to see, hear, and speak, along with advanced tools like weather checks, web search, and RAG.
最后发布版本: 0.6.2 ( 2024-12-09 14:00:55)
🌟 Gemini Multimodal Live API Extension with RTC
Try Google Gemini Multimodal Live API with realtime vision and realtime screenshare detection capabilities, it is a ready-to-use extension, along with powerful tools like Weather Check and Web Search integrated perfectly into TEN Agent.
TEN Agent Usecases
Ready-to-use Extensions
TEN Agent Playground in Local Environment
Prerequisites
Category | Requirements |
---|---|
Keys | • Agora App ID and App Certificate (free minutes every month) • OpenAI API key • Deepgram ASR (free credits available with signup) • FishAudio TTS (free credits available with signup) |
Installation | • Docker / Docker Compose • Node.js(LTS) v18 |
Minimum System Requirements | • CPU >= 2 Core • RAM >= 4 GB |
macOS: Docker setting on Apple Silicon
For Apple Silicon Macs, uncheck "Use Rosetta for x86/amd64 emulation" in Docker settings. Note: This may result in slower build times on ARM, but performance will be normal when deployed to x64 servers.
Next step
1. Create .env
file
cp ./.env.example ./.env
2. Setup Agora App ID and App Certificate in .env
AGORA_APP_ID=
AGORA_APP_CERTIFICATE=
3. Start agent development containers
docker compose up -d
4. Enter container
docker exec -it ten_agent_dev bash
5. Build agent
task use
6. Start the web server
task run
7. Edit playground settings
Open the playground at localhost:3000 to configure your agent.
- Select a graph type (e.g. Voice Agent, Realtime Agent)
- Choose a corresponding module
- Select an extension and configure its API key settings
Running Gemini Realtime Extension
Open the playground at localhost:3000.
- Select voice_assistant_realtime graph
- Choose Gemini Realtime module
- Select v2v extension and enter Gemini API key
TEN Agent Components
Stay Tuned
Before we get started, be sure to star our repository and get instant notifications for all new releases!
Join Community
- Discord: Ideal for sharing your applications and engaging with the community.
- GitHub Discussion: Perfect for providing feedback and asking questions.
- GitHub Issues: Best for reporting bugs and proposing new features. Refer to our contribution guidelines for more details.
- (https://img.shields.io/twitter/follow/TenFramework?logo=X&color=%20%23f5f5f5): Great for sharing your agents and interacting with the community.
Star History
Code Contributors
Contribution Guidelines
Contributions are welcome! Please read the contribution guidelines first.
License
This project is licensed under the Apache 2.0 License - see the LICENSE file for details.
最近版本更新:(数据更新于 2024-12-18 15:59:10)
2024-12-09 14:00:55 0.6.2
2024-11-18 10:37:55 0.6.1
2024-10-10 22:52:37 0.5.0
2024-09-05 08:58:16 v0.4.1
2024-08-19 10:46:36 v0.4.0
2024-08-15 10:18:20 v0.3.0
2024-08-14 23:19:45 v0.3.0-rc1
2024-07-26 14:02:11 v0.2.0
主题(topics):
agent, ai, asr, cpp, gemini, golang, gpt-4, gpt-4o, llm, low-latency, multimodal, nextjs14, openai, python, rag, real-time, realtime, tts, vision, voice-assistant
TEN-framework/TEN-Agent同语言 Python最近更新仓库
2024-12-24 23:43:34 home-assistant/core
2024-12-24 19:27:42 ultralytics/ultralytics
2024-12-24 07:53:44 yt-dlp/yt-dlp
2024-12-23 19:54:46 Zipstack/unstract
2024-12-22 18:18:34 LeslieLeung/heimdallr
2024-12-21 13:26:40 notepad-plus-plus/nppPluginList