Work done with my advisor, Mirella Lapata!
Preprint: arxiv.org/pdf/2505.14627
#EMNLP2025 #multimodallearning #scalableoversight #visionlanguagemodels #nlproc
Work done with my advisor, Mirella Lapata!
Preprint: arxiv.org/pdf/2505.14627
#EMNLP2025 #multimodallearning #scalableoversight #visionlanguagemodels #nlproc
Our analysis show that judges benefit when the experts are arguing for diverse opinions!
Red quadrant is when the judge is persuaded more often than they should (i.e. they are deceptive).
Our analysis show that judges benefit when the experts are arguing for diverse opinions!
Red quadrant is when the judge is persuaded more often than they should (i.e. they are deceptive).
Yes! We show that the reasoning data attained from debate in a completely unsupervised manner imbue reasoning in the expert vision language models.
Yes! We show that the reasoning data attained from debate in a completely unsupervised manner imbue reasoning in the expert vision language models.