Formerly Senior Staff Scientist @GoogleDeepMind, PhD @NYU, @Polytechnique
If this is of interest to you please reach out!
If this is of interest to you please reach out!
If this is of interest to you please reach out!
This time we enabled the distillation of large multimodal models into much smaller ones, simply by choosing the data they learn from.
Sets a new state of the art in small multimodal models that are very efficient for inference!
arxiv.org/abs/2411.18674
Smol models are all the rage these days & knowledge distillation (KD) is key for model compression!
We show how data curation can effectively distill to yield SoTA FLOP-efficient {C/Sig}LIPs!!
🧵👇
This time we enabled the distillation of large multimodal models into much smaller ones, simply by choosing the data they learn from.
Sets a new state of the art in small multimodal models that are very efficient for inference!
👉Language-image pretraining with CLIP or SigLIP is widely used due to strong zero-shot transfer, but ....
👉Language-image pretraining with CLIP or SigLIP is widely used due to strong zero-shot transfer, but ....
Fun project with @confusezius.bsky.social, @zeynepakata.bsky.social, @dimadamen.bsky.social and
@olivierhenaff.bsky.social.
Turns out you can, and here is how: arxiv.org/abs/2411.15099
Really excited to this work on multimodal pretraining for my first bluesky entry!
🧵 A short and hopefully informative thread:
Fun project with @confusezius.bsky.social, @zeynepakata.bsky.social, @dimadamen.bsky.social and
@olivierhenaff.bsky.social.
We find simple changes to multimodal pretraining are sufficient to yield outsized gains on a wide range of few-shot tasks.
Congratulations @confusezius.bsky.social on a very successful internship!
Turns out you can, and here is how: arxiv.org/abs/2411.15099
Really excited to this work on multimodal pretraining for my first bluesky entry!
🧵 A short and hopefully informative thread:
We find simple changes to multimodal pretraining are sufficient to yield outsized gains on a wide range of few-shot tasks.
Congratulations @confusezius.bsky.social on a very successful internship!