[논문] Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
https://arxiv.org/abs/1803.09017 Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech SynthesisIn this work, we propose "global style tokens" (GSTs), a bank of embeddings that are jointly trained within Tacotron, a state-of-the-art end-to-end speech synthesis system. The embeddings are trained with no explicit labels, yet learn to model a large rangarxiv.org해당 논문을..
연구실 공부
2024. 6. 16.