Lightnews — Scholar-powered news

Ashutosh Adhikari

@yourstrulyash.bsky.social

9 followers 34 following 5 posts

PhD student UofEdinurgh.

Posts Replies Media Videos

Ashutosh Adhikari

@yourstrulyash.bsky.social

RQ3: Where do debate or consultancy fail?

Our analysis show that judges benefit when the experts are arguing for diverse opinions!

Red quadrant is when the judge is persuaded more often than they should (i.e. they are deceptive).

November 1, 2025 at 7:30 PM

Ashutosh Adhikari

@yourstrulyash.bsky.social

RQ2: Can debate be used as a reliable mechanism for yielding quality reasoning data?

Yes! We show that the reasoning data attained from debate in a completely unsupervised manner imbue reasoning in the expert vision language models.

November 1, 2025 at 7:30 PM

Ashutosh Adhikari

@yourstrulyash.bsky.social

Excited to share my first work as a PhD student at EdinburghNLP that I will be presenting at EMNLP!

RQ1: Can we achieve scalable oversight across modalities via debate?

Yes! We show that debating VLMs lead to better model quality of answers for reasoning tasks.

November 1, 2025 at 7:30 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news