Mario Sanz
msanz.bsky.social
Mario Sanz
@msanz.bsky.social
PhD student in #NLProc
🧐 Evaluating your LLM with multiple-choice question answering?

🧵 A tiny space in the prompt can make accuracy jump by 11% – and even reshuffle model rankings.

#EMNLP2025 #NLP #AI #LLM #Evaluation
September 26, 2025 at 9:18 AM