www.justinsalamon.com
arxiv.org/abs/2505.053...
Big congratulations to Yusong Yu, @tsirif.bsky.social and the whole team from @adobe.com research, @mit.edu and @mila-quebec.bsky.social
arxiv.org/abs/2505.053...
Big congratulations to Yusong Yu, @tsirif.bsky.social and the whole team from @adobe.com research, @mit.edu and @mila-quebec.bsky.social
The secret sauce: A memory-efficient and calibrated frame-wise objective with logit adjustment to address spurious correlations, such as event dependencies and label imbalances during training
The secret sauce: A memory-efficient and calibrated frame-wise objective with logit adjustment to address spurious correlations, such as event dependencies and label imbalances during training
A model trained to produce a calibrated likelihood for *any* text prompt.
FLAM outperforms prior self-supervised models on both closed-set and open-set SED, while preserving strong retrieval and zero-shot classification accuracy
A model trained to produce a calibrated likelihood for *any* text prompt.
FLAM outperforms prior self-supervised models on both closed-set and open-set SED, while preserving strong retrieval and zero-shot classification accuracy
"So use CLAP", some of you will say.
The problem is its output likelihoods are not calibrated for different prompts :(
That's ok ranked retrieval, but for detection it's a no go.
"So use CLAP", some of you will say.
The problem is its output likelihoods are not calibrated for different prompts :(
That's ok ranked retrieval, but for detection it's a no go.
It has some applications, but it doesn't address general purpose sound search.
It has some applications, but it doesn't address general purpose sound search.
MultiFoley, a Video-to-Audio model that generates perfectly synced audio for video at 48 kHz and supports multimodal conditioning.
More on MultiFoley here: bsky.app/profile/czya...
MultiFoley, a Video-to-Audio model that generates perfectly synced audio for video at 48 kHz and supports multimodal conditioning.
More on MultiFoley here: bsky.app/profile/czya...