[논문] FastSpeech: Fast, Robust and Controllable Text to Speech
https://arxiv.org/abs/1905.09263 FastSpeech: Fast, Robust and Controllable Text to Speech Neural network based end-to-end text to speech (TTS) has significantly improved the quality of synthesized speech. Prominent methods (e.g., Tacotron 2) usually first generate mel-spectrogram from text, and then synthesize speech from the mel-spectrogram us arxiv.org 해당 논문을 보고 작성했습니다. Abstract Neural network..
연구실 공부
2024. 4. 5.