Desmond Elliott
delliott.bsky.social
Desmond Elliott
@delliott.bsky.social
Reposted by Desmond Elliott
📄 [ACL 2025 main] LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks (doi.org/10.48550/arX...)
LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
There is an increasing trend towards evaluating NLP models with LLMs instead of human judgments, raising questions about the validity of these evaluations, as well as their reproducibility in the case...
doi.org
July 18, 2025 at 10:19 AM
The participants brought a lot of energy, enthusiasm, and great posters to highlight their research: @antoniakrm.bsky.social and @saravera.bsky.social pictured.

Finally, I want to think the Danish Data Science Academy, Carlsberg Foundation, and the Villum Foundation for supporting the event!
June 23, 2025 at 3:13 PM
No, we didn’t record anything but there was an excellent live-poster!
June 20, 2025 at 5:41 PM
Your workshop is so popular that someone managing the door on a one-in one-out basis.
June 11, 2025 at 6:52 PM
Thanks for sharing! I'm looking forward to reading this because I enjoyed reading your lecture notes on Natural Language Understanding with Distributed Representation back in the day.
May 8, 2025 at 2:17 PM