(1) ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing
(2) Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation
🔍 More at researchtrend.ai/communities/VGen
code: github.com/FunAudioLLM...
demo: huggingface.co/spaces/FunA...
code: github.com/FunAudioLLM...
demo: huggingface.co/spaces/FunA...
there is a new spiritual successor to mmaudio called ThinkSound that supports chain-of-thought prompts for extremely accurate video-to-audio generation
kinda blown away:
there is a new spiritual successor to mmaudio called ThinkSound that supports chain-of-thought prompts for extremely accurate video-to-audio generation
kinda blown away:
Export mp4 et partage url.
Gratuit, open source et illimité.
#uneIAparjour #IA #audio #vidéo
Export mp4 et partage url.
Gratuit, open source et illimité.
#uneIAparjour #IA #audio #vidéo
Paper: arxiv.org/abs/2506.21448
Paper: arxiv.org/abs/2506.21448
Github: github.com/FunAudioLLM/...
Github: github.com/FunAudioLLM/...
Project: thinksound-project.github.io
Movie Gen + ThinkSound
Project: thinksound-project.github.io
Movie Gen + ThinkSound
Unlike commercial giants, ThinkSound brings chain-of-thought reasoning to multimodal AI, letting it generate audio that’s not just synced, but semantically spot-on.
VEO3 + ThinkSound 🧵1/5
Unlike commercial giants, ThinkSound brings chain-of-thought reasoning to multimodal AI, letting it generate audio that’s not just synced, but semantically spot-on.
VEO3 + ThinkSound 🧵1/5
aidisruption.ai/p/alibaba-op...
aidisruption.ai/p/alibaba-op...
TON of capable grads in BC, ON. Tax incentives, too! www.investcanada.ca/programs-inc...
TON of capable grads in BC, ON. Tax incentives, too! www.investcanada.ca/programs-inc...