From d1c144591ba0d511917ff82b88cae48a63187759 Mon Sep 17 00:00:00 2001 From: Ethan Zhang Date: Thu, 9 Jan 2025 13:48:02 +0800 Subject: [PATCH] feat: add doc support for story teller (#528) --- docs/SUMMARY.md | 9 +-- .../ten_agent/playground/run_story_telling.md | 64 +++++++++++++++++++ docs/ten_agent/playground/run_va.md | 47 ++++++++++++-- docs/ten_agent/playground/run_va_realtime.md | 34 ---------- 4 files changed, 111 insertions(+), 43 deletions(-) create mode 100644 docs/ten_agent/playground/run_story_telling.md delete mode 100644 docs/ten_agent/playground/run_va_realtime.md diff --git a/docs/SUMMARY.md b/docs/SUMMARY.md index 661e1b1d5a..f1d9ecf1ab 100644 --- a/docs/SUMMARY.md +++ b/docs/SUMMARY.md @@ -21,10 +21,11 @@ * [Run Playground](ten_agent/playground/quickstart.md) * [Configure Modules](ten_agent/playground/configure_modules.md) * [Configure Properties](ten_agent/playground/configure_properties.md) - * [Run Voice Assistant](ten_agent/playground/run_va.md) - * [Run Voice Assistant with Realtime API](ten_agent/playground/run_va_realtime.md) - * [Run Dify Chat Bot](ten_agent/playground/run_dify.md) - * [Run Coze Chat Bot](ten_agent/playground/run_coze.md) + * Use Cases + * [Voice Assistant](ten_agent/playground/run_va.md) + * [Dify Chat Bot](ten_agent/playground/run_dify.md) + * [Coze Chat Bot](ten_agent/playground/run_coze.md) + * [Story Teller](ten_agent/playground/run_story_telling.md) * [Change Language](ten_agent/playground/change_language.md) * [Customize TEN Agent](ten_agent/customize_your_agent.md) * [Create a Hello World Extension](ten_agent/create_a_hello_world_extension.md) diff --git a/docs/ten_agent/playground/run_story_telling.md b/docs/ten_agent/playground/run_story_telling.md new file mode 100644 index 0000000000..4d7fe0ec8f --- /dev/null +++ b/docs/ten_agent/playground/run_story_telling.md @@ -0,0 +1,64 @@ +# Run Story Teller with Large Language Model + +This guide will help you to run the Story Teller usecase with Large Language Model in the TEN-Agent Playground. + +## STT + TTS + LLM + +### Prerequisites + +- Make sure you have the TEN-Agent playground running. If not, follow the [Run Playground](https://doc.theten.ai/ten-agent/quickstart) guide to start the playground. +- You will need following information prepared: + - STT info, any supported STT can be used. [Deepgram](https://deepgram.com/) is relatively easy to register and get started with. + - TTS info, any supported TTS can be used. [Fish.Audio](https://fish.audio/) is relatively easy to register and get started with. + - LLM info, For this use case only [OpenAI](https://openai.com) or OpenAI API compatible models are supported. + - RTC info, currently only Agora RTC is supported. You can register your account at [Agora](https://www.agora.io/). We assume you have your App ID and App Certificate ready when you configure your `.env` file. + +### Steps + +1. Open the playground at [localhost:3000](http://localhost:3000) to configure your agent. +2. Select the graph type `story_teller`. +3. Click on `Module Picker` to open the module selection. +4. If you preferred STT/TTS module is not by default selected, you can select the module from the dropdown list. Note you will need to configure the module with the correct information like API key, etc. +5. The `LLM` module is preconfigured to have `OpenAI ChatGPT` selected, don't change it. +6. Click on `Save Change` to apply the module to the graph. +7. Click on the Button to the right of the graph selection to open the property configuration. You will see a list of properties that can be configured for the selected Large Language Model. +8. Configure the properties with the information you prepared. +9. Click on `Save Change` to apply the properties to the Large Language Model. +10. If you see the success toast, the properties are successfully applied to the Large Language Model. +11. You are all set! Now you can start speaking to the Voice Assistant by clicking on the `Connect` button. Note you will need to wait a few seconds for agent to initialzie itself. + +### Using Azure STT + +Azure STT is integrated within RTC extension module. That's why if you want to use Azure STT, you will need to select `story_teller_integrated_stt` graph type. + +### Bind Tools + +The story_teller use case is preconfigured to use `openai_image_generate_tool`, so usually you don't need to change anything. + + +## Realtime V2V + + +### Prerequisites + +- Make sure you have the TEN-Agent playground running. If not, follow the [Run Playground](https://doc.theten.ai/ten-agent/quickstart) guide to start the playground. +- You will need following information prepared: + - Realtime API Key +- RTC info, currently only Agora RTC is supported. You can register your account at [Agora](https://www.agora.io/). We assume you have your App ID and App Certificate ready when you configure your `.env` file. + +### Steps + +1. Open the playground at [localhost:3000](http://localhost:3000) to configure your agent. +2. Select the graph type `story_teller_realtime`. +3. Click on `Module Picker` to open the module selection. +4. The `V2V` module is preconfigured to have `OpenAI Realtime` selected. You can select other V2V modules from the dropdown list if needed. Note you will need to copy the `prompt` property from the `OpenAI Realtime` module to the new module, as module properties will be reset to default when switching. +5. Click on `Save Change` to apply the module to the graph if you have changed the V2V module, or if you have not changed the V2V module, you can skip this step. +6. Click on the Button to the right of the graph selection to open the property configuration. You will see a list of properties that can be configured for the selected V2V module. +7. Configure the `Realtime API Key` property with the information you prepared. If you have changed the V2V module in previous steps, do remember to copy the `prompt` property from the `OpenAI Realtime` module to the new module. +8. Click on `Save Change` to apply the property to the V2V module. +9. If you see the success toast, the property is successfully applied to the V2V module. +10. You are all set! Now you can start speaking to the Voice Assistant by clicking on the `Connect` button. Note you will need to wait a few seconds for agent to initialzie itself. + +### Bind Tools + +The story_teller_realtime use case is preconfigured to use `openai_image_generate_tool`, so usually you don't need to change anything. diff --git a/docs/ten_agent/playground/run_va.md b/docs/ten_agent/playground/run_va.md index 2300f6b0ae..f2d8491e41 100644 --- a/docs/ten_agent/playground/run_va.md +++ b/docs/ten_agent/playground/run_va.md @@ -2,16 +2,18 @@ This guide will help you to run the Voice Assistant with Large Language Model in the TEN-Agent Playground. -## Prerequisites +## STT + TTS + LLM + +### Prerequisites - Make sure you have the TEN-Agent playground running. If not, follow the [Run Playground](https://doc.theten.ai/ten-agent/quickstart) guide to start the playground. - You will need following information prepared: - STT info, any supported STT can be used. [Deepgram](https://deepgram.com/) is relatively easy to register and get started with. - TTS info, any supported TTS can be used. [Fish.Audio](https://fish.audio/) is relatively easy to register and get started with. - - LLM info, any supported LLM can be used. It's recommended to use [OpenAI](https://openai.com) + - LLM info, any supported LLM can be used. It's recommended to use [OpenAI](https://openai.com). - RTC info, currently only Agora RTC is supported. You can register your account at [Agora](https://www.agora.io/). We assume you have your App ID and App Certificate ready when you configure your `.env` file. -## Steps +### Steps 1. Open the playground at [localhost:3000](http://localhost:3000) to configure your agent. 2. Select the graph type `voice_assistant`. @@ -25,11 +27,11 @@ This guide will help you to run the Voice Assistant with Large Language Model in 10. If you see the success toast, the properties are successfully applied to the Large Language Model. 11. You are all set! Now you can start speaking to the Voice Assistant by clicking on the `Connect` button. Note you will need to wait a few seconds for agent to initialzie itself. -## Using Azure STT +### Using Azure STT Azure STT is integrated within RTC extension module. That's why if you want to use Azure STT, you will need to select `voice_assistant_integrated_stt` graph type. -## Bind Weather Tool to your LLM +### Bind Weather Tool to your LLM You can bind weather tool to your LLM module in the TEN-Agent Playground. It's recommended to use OpenAI LLM below. @@ -40,3 +42,38 @@ It's recommended to use OpenAI LLM below. 4. Click on `Save Change` to apply the tool to the LLM module. 5. If you see the success toast, the tool is successfully applied to the LLM module. 6. You are all set! Now you can ask the agent about the weather by speaking to it. + + +## Realtime V2V + + +### Prerequisites + +- Make sure you have the TEN-Agent playground running. If not, follow the [Run Playground](https://doc.theten.ai/ten-agent/quickstart) guide to start the playground. +- You will need following information prepared: + - Realtime API Key +- RTC info, currently only Agora RTC is supported. You can register your account at [Agora](https://www.agora.io/). We assume you have your App ID and App Certificate ready when you configure your `.env` file. + +### Steps + +1. Open the playground at [localhost:3000](http://localhost:3000) to configure your agent. +2. Select the graph type `voice_assistant_realtime`. +3. Click on `Module Picker` to open the module selection. +4. Select your preferred V2V module from the dropdown list. +5. Click on `Save Change` to apply the module to the graph. +6. Click on the Button to the right of the graph selection to open the property configuration. You will see a list of properties that can be configured for the selected V2V module. +7. Configure the `Realtime API Key` property with the information you prepared. +8. Click on `Save Change` to apply the property to the V2V module. +9. If you see the success toast, the property is successfully applied to the V2V module. +10. You are all set! Now you can start speaking to the Voice Assistant by clicking on the `Connect` button. Note you will need to wait a few seconds for agent to initialzie itself. + +### Bind Weather Tool to your V2V + +You can bind weather tool to your V2V module in the TEN-Agent Playground. + +1. When you have your agent running. Open Module Picker. +2. Click on the button to the right of the V2V module to open the tool selection. +3. Select `Weather Tool` from the popover list. +4. Click on `Save Change` to apply the tool to the V2V module. +5. If you see the success toast, the tool is successfully applied to the V2V module. +6. You are all set! Now you can ask the agent about the weather by speaking to it. diff --git a/docs/ten_agent/playground/run_va_realtime.md b/docs/ten_agent/playground/run_va_realtime.md deleted file mode 100644 index 90b95b70b9..0000000000 --- a/docs/ten_agent/playground/run_va_realtime.md +++ /dev/null @@ -1,34 +0,0 @@ -# Run Voice Assistant with Voice to Voice Realtime API - -This guide will help you to run the Voice Assistant with Voice to Voice Realtime API in the TEN-Agent Playground. - -## Prerequisites - -- Make sure you have the TEN-Agent playground running. If not, follow the [Run Playground](https://doc.theten.ai/ten-agent/quickstart) guide to start the playground. -- You will need following information prepared: - - Realtime API Key -- RTC info, currently only Agora RTC is supported. You can register your account at [Agora](https://www.agora.io/). We assume you have your App ID and App Certificate ready when you configure your `.env` file. - -## Steps - -1. Open the playground at [localhost:3000](http://localhost:3000) to configure your agent. -2. Select the graph type `voice_assistant_realtime`. -3. Click on `Module Picker` to open the module selection. -4. Select your preferred V2V module from the dropdown list. -5. Click on `Save Change` to apply the module to the graph. -6. Click on the Button to the right of the graph selection to open the property configuration. You will see a list of properties that can be configured for the selected V2V module. -7. Configure the `Realtime API Key` property with the information you prepared. -8. Click on `Save Change` to apply the property to the V2V module. -9. If you see the success toast, the property is successfully applied to the V2V module. -10. You are all set! Now you can start speaking to the Voice Assistant by clicking on the `Connect` button. Note you will need to wait a few seconds for agent to initialzie itself. - -## Bind Weather Tool to your V2V - -You can bind weather tool to your V2V module in the TEN-Agent Playground. - -1. When you have your agent running. Open Module Picker. -2. Click on the button to the right of the V2V module to open the tool selection. -3. Select `Weather Tool` from the popover list. -4. Click on `Save Change` to apply the tool to the V2V module. -5. If you see the success toast, the tool is successfully applied to the V2V module. -6. You are all set! Now you can ask the agent about the weather by speaking to it.