Skip to content

Latest commit

 

History

History
66 lines (45 loc) · 2.4 KB

README.md

File metadata and controls

66 lines (45 loc) · 2.4 KB

AI-Vtuber

This code is designed to read chat messages from YouTube and then utilize OpenAI's GPT-3 language model to generate responses. The output from GPT-3 is then read out loud using a TTS (Text-to-Speech) engine provided by ElevenLabs.

Setup

Install dependencies

git clone https://github.com/Koischizo/AI-Vtuber/
cd AI-Vtuber
pip install -r requirements.txt

It also requires ffmpeg to be installed

Usage

Edit the variables EL_key and OAI_key in config.json

EL_key is the API key for ElevenLabs. Found in Profile Settings

OAI_key is the API key for OpenAI. Found here

Then run run.py

Default TTS

python run.py -id STREAMID 

Elevenlabs TTS

python run.py -id STREAMID -tts EL 

then you're set

Notes

Replace STREAMID with the stream's ID that you can find on the Youtube Stream link

You can change the voice by changing voice in config.json. You can find the ID's here in Get Voices

Live Demo

Livestream 1

Livestream 2

Other

I used This VTS plugin and VB Audio cable to make her mouth move and be able to play music at the same time

Please note that this project was created solely for fun and as part of a YouTube video, so the quality and reliability of the code may be questionable. Also, after the completion of the project checklist, there won't be much activity in updating or improving this repository. Nonetheless, we hope that this project can serve as a source of inspiration for anyone interested in building their own AI Vtuber.

  • Clean up
  • GUI
  • Executables (exe, bat or sh)
  • Extra features (maybe) (Prompt injection protection, questions only mode, virtual audio)

License

This program is under the MIT license