Here is 𝐒𝐭𝐚𝐛𝐥𝐞-𝐕𝟐𝐀 which generates sound effects from silent video frames showing semantic and temporal alignment.
🎶🥁🎛️
Huge thanks to @riccardofosco.bsky.social Christian Marinoni and all co-authors 🤟
Here is 𝐒𝐭𝐚𝐛𝐥𝐞-𝐕𝟐𝐀 which generates sound effects from silent video frames showing semantic and temporal alignment.
🎶🥁🎛️
Huge thanks to @riccardofosco.bsky.social Christian Marinoni and all co-authors 🤟
👇 Go check it out!
Here we present Stable-V2A: Synthesis of Synchronized Sound Effects with Temporal and Semantic Controls
arxiv: arxiv.org/abs/2412.15023
Video presentation and results: ispamm.github.io/Stable-V2A
👇 Go check it out!
Here we present Stable-V2A: Synthesis of Synchronized Sound Effects with Temporal and Semantic Controls
arxiv: arxiv.org/abs/2412.15023
Video presentation and results: ispamm.github.io/Stable-V2A
Here we present Stable-V2A: Synthesis of Synchronized Sound Effects with Temporal and Semantic Controls
arxiv: arxiv.org/abs/2412.15023
Video presentation and results: ispamm.github.io/Stable-V2A