Skip to content

smilingDima/sova-tts

 
 

Repository files navigation

SOVA TTS

SOVA TTS is a speech syntthesis solution based on Tacotron 2 architecture. It is designed as a REST API service and it can be customized (both code and models) for your needs.

Installation

The easiest way to deploy the service is via docker-compose, so you have to install Docker and docker-compose first. Here's a brief instruction for Ubuntu:

Build and deploy

  • Clone the repository, download the pretrained models archive and extract the contents into the project folder:
git clone --recursive https://github.com/smilingDima/sova-tts.git
cd sova-tts/
wget http://dataset.sova.ai/SOVA-TTS/Data_v1.1.tar
tar -xvf Data_v1.1.tar && rm Data_v1.1.tar
  • Build docker image

    • Build sova-tts image if you're planning on using CPU:
    sudo docker-compose build sova-tts
  • Run the desired service container

    • CPU:
    sudo docker-compose up -d sova-tts

Testing

To test the service you can send a POST request:

curl --request POST 'http://localhost:8899/synthesize/' \
--header 'Content-Type: application/json' \
--data-raw '{
    "text": "Добрый день! Как ваши дел+а?",
    "voice": "Natasha"
}'

Acknowledgements

Original Tacotron 2 implementation by NVIDIA.

About

sova-tts for RedOS

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 65.1%
  • CSS 17.2%
  • HTML 10.3%
  • JavaScript 7.4%