@marwahalaofi.com @iroldie.bsky.social
www.damianospina.com/publication/...
@marwahalaofi.com @iroldie.bsky.social
www.damianospina.com/publication/...
theconversation.com/what-makes-a...
@admscentre.org.au
@rmitcomputing.bsky.social
@umbrellacorpn.bsky.social
theconversation.com/what-makes-a...
@admscentre.org.au
@rmitcomputing.bsky.social
@umbrellacorpn.bsky.social
We present an LLM-based pipeline that boosts relevance assessment accuracy through modular classification.
#SIGIR2025
We present an LLM-based pipeline that boosts relevance assessment accuracy through modular classification.
#SIGIR2025
aclanthology.org/2024.finding...
Why I personally think it works better, mainly because it's hard to calibrate a pointwise relevance prediction, but a pairwise prediction hardly needs calibration.
Here I talk about using Qwen 2.5 locally as a local pairwise search evaluator
softwaredoug.com/blog/2025/01...
sigir2025.dei.unipd.it/keynote-spea...
sigir2025.dei.unipd.it/keynote-spea...
Evaluation Perspectives of #RecSys. Edited by @christinebauer.bsky.social, @evazangerle.bsky.social, and myself. Written by a whole host of fantastic #recsys people. Too many to mention (pic here www.dagstuhl.de/en/seminars/...)
drops.dagstuhl.de/entities/doc...
Evaluation Perspectives of #RecSys. Edited by @christinebauer.bsky.social, @evazangerle.bsky.social, and myself. Written by a whole host of fantastic #recsys people. Too many to mention (pic here www.dagstuhl.de/en/seminars/...)
drops.dagstuhl.de/entities/doc...