riccardofosco.bsky.social
@riccardofosco.bsky.social
Sound effects, audio & video | PhD at @ISPAMM, @Sapienza | Former @C4DM, @QMUL
Reposted
Yet another pre-Christmas release!! 🎅🎄
Here is 𝐒𝐭𝐚𝐛𝐥𝐞-𝐕𝟐𝐀 which generates sound effects from silent video frames showing semantic and temporal alignment.
🎶🥁🎛️

Huge thanks to @riccardofosco.bsky.social Christian Marinoni and all co-authors 🤟
December 23, 2024 at 2:01 PM
Reposted
Super interesting work on #GenAI #Video2Audio with impressive results from my friends @riccardofosco.bsky.social @Christian Marinoni together with @emilianpos.bsky.social @mcomunita.bsky.social Luca Cosmo, Joshua Reiss and @dacom.bsky.social !

👇 Go check it out!
🌟 Excited to Share Our Latest Work! 🎥🎶

Here we present Stable-V2A: Synthesis of Synchronized Sound Effects with Temporal and Semantic Controls

arxiv: arxiv.org/abs/2412.15023
Video presentation and results: ispamm.github.io/Stable-V2A
Stable-V2A: Synchronized Sound Effects Synthesis
Stable-V2A is a two-stage model for synthesizing synchronized sound effects with support for temporal and semantic controls.
ispamm.github.io
December 20, 2024 at 6:37 PM
🌟 Excited to Share Our Latest Work! 🎥🎶

Here we present Stable-V2A: Synthesis of Synchronized Sound Effects with Temporal and Semantic Controls

arxiv: arxiv.org/abs/2412.15023
Video presentation and results: ispamm.github.io/Stable-V2A
Stable-V2A: Synchronized Sound Effects Synthesis
Stable-V2A is a two-stage model for synthesizing synchronized sound effects with support for temporal and semantic controls.
ispamm.github.io
December 20, 2024 at 6:19 PM