Ashutosh Adhikari
yourstrulyash.bsky.social
Ashutosh Adhikari
@yourstrulyash.bsky.social
PhD student UofEdinurgh.
Reposted by Ashutosh Adhikari
From medicine to geo-guessing, humans can get incredibly good at solving visual recognition tasks.
But how is this skill learned, and can we model its progression?
We present CleverBirds, accepted #NeurIPS2025, a large-scale benchmark for visual knowledge tracing.
📄 arxiv.org/abs/2511.08512
1/5
CleverBirds: A Multiple-Choice Benchmark for Fine-grained Human Knowledge Tracing
Mastering fine-grained visual recognition, essential in many expert domains, can require that specialists undergo years of dedicated training. Modeling the progression of such expertize in humans rema...
arxiv.org
November 12, 2025 at 3:29 PM
Excited to share my first work as a PhD student at EdinburghNLP that I will be presenting at EMNLP!

RQ1: Can we achieve scalable oversight across modalities via debate?

Yes! We show that debating VLMs lead to better model quality of answers for reasoning tasks.
November 1, 2025 at 7:30 PM