[논문] Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
https://arxiv.org/abs/2106.06103 Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-SpeechSeveral recent end-to-end text-to-speech (TTS) models enabling single-stage training and parallel sampling have been proposed, but their sample quality does not match that of two-stage TTS systems. In this work, we present a parallel end-to-end TTS methodarxiv.org해당 논문을 보고 ..
연구실 공부
2024. 5. 23.