✅ This phenomenon is generalizable across models (Llama3, GPT-4o) & tasks (math, commonsense, verbal).
✅ This phenomenon is generalizable across models (Llama3, GPT-4o) & tasks (math, commonsense, verbal).
📄 arxiv.org/pdf/2502.11364
Multilingual LLMs like Llama3.1/Qwen2.5 have shown English-rivalling performance on high-resource languages, while they often significantly underperform on low-resource languages.
#NLP #Multilingual #LLM
📄 arxiv.org/pdf/2502.11364
Multilingual LLMs like Llama3.1/Qwen2.5 have shown English-rivalling performance on high-resource languages, while they often significantly underperform on low-resource languages.
#NLP #Multilingual #LLM