Eran Hirsch
eranhirsch.bsky.social
Eran Hirsch
@eranhirsch.bsky.social
PhD candidate @biunlp ; Tweets about NLP, ML and research ; https://eranhirs.github.io/
Reposted by Eran Hirsch
New preprint! ✨
Interested in LLM-as-a-Judge?
Want to get the best judge for ranking your system?
our new work is just for you:
"JuStRank: Benchmarking LLM Judges for System Ranking"
🕺💃
arxiv.org/abs/2412.09569
JuStRank: Benchmarking LLM Judges for System Ranking
Given the rapid progress of generative AI, there is a pressing need to systematically compare and choose between the numerous models and configurations available. The scale and versatility of such eva...
arxiv.org
December 13, 2024 at 10:16 AM