这是「数字生命」服务部分代码。包括与前端通信,语音识别,chatGPT接入和语音合成。
For other part of the project, please refer to:
Launcher 启动此服务器的图形界面。
UE Client 用于渲染人物动画,录音,和播放声音的前端部分。
详细的配置流程可参见readme_detail.md
git clone https://github.com/zixiiu/Digital_Life_Server.git --recursive
-
install pytorch
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
-
install other requirements
pip install -r requirements.txt
-
Build
monotonic_align
This may not work that well but you know what that suppose to mean.cd "TTS/vits/monotonic_align" mkdir monotonic_align python setup.py build_ext --inplace
Windows系统的话, 执行
cp monotonic_align/*.pyd .
Linux系统的话, 执行
cp monotonic_align/*.so .
-
Download models
百度网盘
HuggingFaceASR Model:
to/ASR/resources/models
Sentiment Model:
to/SentimentEngine/models
TTS Model:
to/TTS/models
-
(对于没有Nvidia显卡的电脑,采用cpu来跑的话)需要额外做一步:
将 Digital_Life_Server\TTS\TTService.py 文件下 36行
self.net_g = SynthesizerTrn(...).cuda() 修改为 self.net_g = SynthesizerTrn(...).cpu()
到这里,项目构建完毕🥰
run-gpt3.5-api.bat