📍Prague, CZ. 🔗 http://users.ntua.gr/psomasbill/
📄 arxiv.org/abs/2510.25387
🧪 github.com/billpsomas/i...
George Retsinas, @nikos-efth.bsky.social, Panagiotis Filntisis, Yannis Avrithis, Petros Maragos, Ondrej Chum, @gtolias.bsky.social.
📄 arxiv.org/abs/2510.25387
🧪 github.com/billpsomas/i...
George Retsinas, @nikos-efth.bsky.social, Panagiotis Filntisis, Yannis Avrithis, Petros Maragos, Ondrej Chum, @gtolias.bsky.social.
⚡BASIC: training-free pipeline (centering, projection with PCA, textual contextualization, Harris-style fusion) with strong results across i-CIR and class-level CIR benchmarks.
⚡BASIC: training-free pipeline (centering, projection with PCA, textual contextualization, Harris-style fusion) with strong results across i-CIR and class-level CIR benchmarks.
📊~750K images, 202 instances, ~1,900 composed queries. Despite small per-query DBs (~3.7K images), i-CIR matches the difficulty of searching with >40M random distractors.
📊~750K images, 202 instances, ~1,900 composed queries. Despite small per-query DBs (~3.7K images), i-CIR matches the difficulty of searching with >40M random distractors.
🗂️ Per instance we share a database and define:
- composed positives (same object + modification)
- hard negatives:
- visual (same/similar object, wrong text)
- textual (right text, wrong instance)
- composed (near-miss on both).
🗂️ Per instance we share a database and define:
- composed positives (same object + modification)
- hard negatives:
- visual (same/similar object, wrong text)
- textual (right text, wrong instance)
- composed (near-miss on both).
🔎 Gap in the community: Existing CIR benchmarks are class-level, ambiguous, without explicit hard negatives, and often reward text-only behaviour. We needed a dataset that truly requires both image and text, at the instance level. i-CIR fills that gap.
🔎 Gap in the community: Existing CIR benchmarks are class-level, ambiguous, without explicit hard negatives, and often reward text-only behaviour. We needed a dataset that truly requires both image and text, at the instance level. i-CIR fills that gap.