Skip to content

radoslawregula/VoxG

Repository files navigation

VoxG: Singing voice synthesizer using GAN.

This repository contains implementation of a Wasserstein-GAN for singing voice synthesis in Tensorflow 2.5. The neural networks were trained using publicly available NUS-48E sung and spoken lyrics corpus.

The code is based on an original implementation described in WGANSing: A Multi-Voice Singing Voice Synthesizer Based on the Wasserstein-GAN, however with improvements regarding the training process and code quality. The implementation of WGAN can be reused for other similar tasks.

This project was completed as a Master's thesis in Computer Science, Adam Mickiewicz University, Poznań, Poland.