🧵 A tiny space in the prompt can make accuracy jump by 11% – and even reshuffle model rankings.
#EMNLP2025 #NLP #AI #LLM #Evaluation
🧵 A tiny space in the prompt can make accuracy jump by 11% – and even reshuffle model rankings.
#EMNLP2025 #NLP #AI #LLM #Evaluation