[논문] What Do I Hear? Generating Sounds for Visual with ChatGPT
https://arxiv.org/abs/2311.05609 What Do I Hear? Generating Sounds for Visuals with ChatGPTThis short paper introduces a workflow for generating realistic soundscapes for visual media. In contrast to prior work, which primarily focus on matching sounds for on-screen visuals, our approach extends to suggesting sounds that may not be immediately varxiv.org해당 논문을 보고 작성했습니다. Introduction이 논문은 visual..
연구실 공부
2024. 6. 19.