[논문] One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization
https://arxiv.org/abs/1904.05742 One-shot Voice Conversion by Separating Speaker and Content Representations with Instance NormalizationRecently, voice conversion (VC) without parallel data has been successfully adapted to multi-target scenario in which a single model is trained to convert the input voice to many different speakers. However, such model suffers from the limitation that it carxiv...
연구실 공부
2024. 5. 12.