Speed difference for longer input text #296

Ananya21162 · 2024-12-09T18:32:58Z

We are noticing very slow speed for small sentences and for longer sentences, the model starts normally and then gradually increases the speed to quite noticeably high, which sounds un-natural often.
What could be the possible cause for this? Can anyone please help!

UmerrAhsan · 2024-12-12T09:16:46Z

Latency generally increases as the length of the input sentence grows. However, a slowdown for short sentences is not typical and might indicate an issue. I've worked with StyleTTS2 and successfully reduced its latency by 2.5-3 times. If you can share your model file, I can investigate further to pinpoint the issue.

One possible reason for unnatural output is that StyleTTS2 is trained on audiobook datasets, where the style is tailored toward narration. This makes it perform well for longer sentences but struggle with shorter text, leading to degraded quality. Additionally, the model is trained with a high maximum sequence length, which could also explain the inconsistency when dealing with shorter inputs.

Ananya21162 · 2024-12-20T08:22:52Z

Thank you so much for your response.
I have trained model with libriTTS + 50 hrs of audio with max seq length=512.
For very short input like : "Slide 1", the output is very slow.
For very long inputs like: "The Supplier Accounts Receivable Specialist ensures the accurate submission of supplier invoices by verifying all required details, such as purchase order references and amounts, before uploading them into the system." The output is relatively fast.
I am not sure what could be the possible reason? Is there something we can do while training the model?

UmerrAhsan · 2024-12-20T12:53:26Z

Hi @Ananya21162,

Without seeing the code, I can't say much, but what I would suggest is to perform an inner ablation study. Print the time taken for each component during inference—such as the text encoder, BERT, alignment, prosody predictor, decoder, diffusion, and other relevant components. This way, you can identify which specific component is causing the issue, and that will help pinpoint the problem. Then let me know, and we can further debug it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed difference for longer input text #296

Speed difference for longer input text #296

Ananya21162 commented Dec 9, 2024

UmerrAhsan commented Dec 12, 2024

Ananya21162 commented Dec 20, 2024

UmerrAhsan commented Dec 20, 2024

Speed difference for longer input text #296

Speed difference for longer input text #296

Comments

Ananya21162 commented Dec 9, 2024

UmerrAhsan commented Dec 12, 2024

Ananya21162 commented Dec 20, 2024

UmerrAhsan commented Dec 20, 2024