www.justinsalamon.com
arXiv (ICML'25): arxiv.org/abs/2505.053...
demos: flam-model.github.io
Led by Yusong Wu, with @tsirif.bsky.social Ke Chen, Cheng-Zhi Anna Huang, Aaron Courville, @urinieto.bsky.social @pseeth.bsky.social
arXiv (ICML'25): arxiv.org/abs/2505.053...
demos: flam-model.github.io
Led by Yusong Wu, with @tsirif.bsky.social Ke Chen, Cheng-Zhi Anna Huang, Aaron Courville, @urinieto.bsky.social @pseeth.bsky.social
w/ @urinieto.bsky.social @pseeth.bsky.social
w/ @urinieto.bsky.social @pseeth.bsky.social
The audio model was built by our team, the Sound Design AI (SODA) group at Adobe Research w/ @pseeth.bsky.social and @urinieto.bsky.social 🙌
www.youtube.com/watch?v=_Bv5...
The audio model was built by our team, the Sound Design AI (SODA) group at Adobe Research w/ @pseeth.bsky.social and @urinieto.bsky.social 🙌
www.youtube.com/watch?v=_Bv5...
Amazing job @hugofloresgarcia.bsky.social @pseeth.bsky.social @urinieto.bsky.social
I should've done my hair...
www.instagram.com/reel/DEEBRhd...
Amazing job @hugofloresgarcia.bsky.social @pseeth.bsky.social @urinieto.bsky.social
I should've done my hair...
www.instagram.com/reel/DEEBRhd...
www.technologyreview.com/2024/12/18/1...
www.technologyreview.com/2024/12/18/1...
Big changes are coming for WASPAA 2025!
Big changes are coming for WASPAA 2025!
Takes a text prompt + vocal (or sonic) imitation and generates sound effects that perfectly match the energy and dynamics of your voice.
It's an extremely intuitive (and fun!) way to create SFX that are perfectly timed to your video.
Led by @hugofloresgarcia.bsky.social 👏
Sketch2Sound can create sounds from sonic imitations (i.e., a vocal imitation or a reference sound) via interpretable, time-varying control signals.
paper: arxiv.org/abs/2412.08550
web: hugofloresgarcia.art/sketch2sound
Takes a text prompt + vocal (or sonic) imitation and generates sound effects that perfectly match the energy and dynamics of your voice.
It's an extremely intuitive (and fun!) way to create SFX that are perfectly timed to your video.
Led by @hugofloresgarcia.bsky.social 👏
Full details in the link in the post below, DM me if interested.
The Sound Design AI Group (SODA) is looking for an exceptional research engineer to join us in building the future of AI-assisted audio and video creation.
Strong ML background, GenAI experience a plus.
Details: adobe.wd5.myworkdayjobs.com/external_exp...
Full details in the link in the post below, DM me if interested.
The Sound Design AI Group (SODA) is looking for an exceptional research engineer to join us in building the future of AI-assisted audio and video creation.
Strong ML background, GenAI experience a plus.
Details: adobe.wd5.myworkdayjobs.com/external_exp...
The Sound Design AI Group (SODA) is looking for an exceptional research engineer to join us in building the future of AI-assisted audio and video creation.
Strong ML background, GenAI experience a plus.
Details: adobe.wd5.myworkdayjobs.com/external_exp...
MultiFoley generates perfectly synced audio for video at full 48 kHz and supports multimodal conditioning.
You can define the generated sound via a text prompt, an example SFX, or audio you want to extend.
Led by our intern @czyang.bsky.social 👇
We can
⌨️Make a typewriter sound like a piano 🎹
🐱Make a cat meow like a lion roars! 🦁
⏱️Perfectly time existing SFX 💥 to a video.
arXiv: arxiv.org/abs/2411.17698
website: ificl.github.io/MultiFoley/
MultiFoley generates perfectly synced audio for video at full 48 kHz and supports multimodal conditioning.
You can define the generated sound via a text prompt, an example SFX, or audio you want to extend.
Led by our intern @czyang.bsky.social 👇