Mainly focused on fine-grained image representations that scale.
Curious how well the latest models can recognize particular objects?
We evaluated the base and large variants of DINOv3 and Perception Encoder (PE) on instance-level image retrieval.
See the results 👉 vrg.fel.cvut.cz/ilias/
Curious how well the latest models can recognize particular objects?
We evaluated the base and large variants of DINOv3 and Perception Encoder (PE) on instance-level image retrieval.
See the results 👉 vrg.fel.cvut.cz/ilias/
We will release code, models and datasets within next 2 weeks.
We are also working on a search demo for the proposed datasets with user prompts!
I hope to see you all in Honolulu!
Excited to announce our new work: "Large-scale Pre-training for Grounded Video Caption Generation" with Cordelia Schmid & @josef-sivic.bsky.social.
Paper: arxiv.org/abs/2503.10781
Project: ekazakos.github.io/grounded_vid...
Code (coming soon): github.com/ekazakos/grove 1/7
We will release code, models and datasets within next 2 weeks.
We are also working on a search demo for the proposed datasets with user prompts!
I hope to see you all in Honolulu!
Instance-Level Recognition and Generation (ILR+G) Workshop at ICCV2025 @iccv.bsky.social
📅 new deadline: June 26, 2025 (23:59 AoE)
📄 paper submission: cmt3.research.microsoft.com/ILRnG2025
🌐 ILR+G website: ilr-workshop.github.io/ICCVW2025/
#ICCV2025 #ComputerVision #AI
Instance-Level Recognition and Generation (ILR+G) Workshop at ICCV2025 @iccv.bsky.social
📅 new deadline: June 26, 2025 (23:59 AoE)
📄 paper submission: cmt3.research.microsoft.com/ILRnG2025
🌐 ILR+G website: ilr-workshop.github.io/ICCVW2025/
#ICCV2025 #ComputerVision #AI
cmp.felk.cvut.cz/colloquium/#...
cmp.felk.cvut.cz/colloquium/#...
LPOSS is a training-free method for open-vocabulary semantic segmentation using Vision-Language Models.
LPOSS is a training-free method for open-vocabulary semantic segmentation using Vision-Language Models.
@gkordo.bsky.social, Vladan Stojnić @annetka.bsky.social Pavel Šuma, Nikolaos-Antonios Ypsilantis @nikos-efth.bsky.social Zakaria Laskar,Jiří Matas, Ondřej Chum, @gtolias.bsky.social
tl;dr: SigLIP rules. Lots of ablations
arxiv.org/abs/2502.11748
1/
@gkordo.bsky.social, Vladan Stojnić @annetka.bsky.social Pavel Šuma, Nikolaos-Antonios Ypsilantis @nikos-efth.bsky.social Zakaria Laskar,Jiří Matas, Ondřej Chum, @gtolias.bsky.social
tl;dr: SigLIP rules. Lots of ablations
arxiv.org/abs/2502.11748
1/
#Internship #CV
#Internship #CV