#NLProc #Multimodal
arXiv: arxiv.org/abs/2405.02793
#NLProc #ComputerVision #Multimodal
Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. It’s #1 on the LM Arena leaderboard. 🥇
Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. It’s #1 on the LM Arena leaderboard. 🥇
🖼️ Image Descriptions to improve Image-Text alignment
AND/OR
💬Multi/Cross Lingual image-text understanding/generation
AND/OR
🌏Geo-Cultural representation and learning
Please DM if you are willing to discuss the current state/challenges/future-work.
🖼️ Image Descriptions to improve Image-Text alignment
AND/OR
💬Multi/Cross Lingual image-text understanding/generation
AND/OR
🌏Geo-Cultural representation and learning
Please DM if you are willing to discuss the current state/challenges/future-work.
we work on image, video, audio, etc… come work with us if you’re interested! apply asap :)
we work on image, video, audio, etc… come work with us if you’re interested! apply asap :)
arXiv: arxiv.org/abs/2405.02793
#NLProc #ComputerVision #Multimodal
arXiv: arxiv.org/abs/2405.02793
#NLProc #ComputerVision #Multimodal