-
Notifications
You must be signed in to change notification settings - Fork 353
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Realtime TTS #1187
Comments
Unfortunately this is not possible at this time, since the TTS can only work on the completed text. Perhaps if you disable streaming it might feel better? |
You can break the streamed response into sentences and then run the TTS on each sentence, playing it back to the user. In this case you would only have to wait until the first sentence is created. This is what I do in my speech-to-speech project. |
Thank you both for the advice. How do I break the streamed response intosentences? |
The pseudocode is
You would ideally run this in a separate thread and queue the sentences. In another thread use TTS to generate audio from each sentence and queue that. Finally, in yet another thread play each audio file. |
Thank you |
The Koboldcpp app is amazing. The only issue I see is the TTS occurs after the text is finished which takes forever. Is there a way to have the TTS occur as the text is being outputted to reduce the delay information being outputted?
The text was updated successfully, but these errors were encountered: