👩🏻💻 https://agromanou.github.io/
@negarforoutan.bsky.social, Anna Sotnikova, @eric-zemingchen.bsky.social, Sree Harsha Nelaturu, Shivalika Singh, Rishabh Maheshwary, Micol Altomare, Mohamed A Haggag, Imanol Schlag, @mziizm.bsky.social, @sarahooker.bsky.social, @abosselut.bsky.social
@negarforoutan.bsky.social, Anna Sotnikova, @eric-zemingchen.bsky.social, Sree Harsha Nelaturu, Shivalika Singh, Rishabh Maheshwary, Micol Altomare, Mohamed A Haggag, Imanol Schlag, @mziizm.bsky.social, @sarahooker.bsky.social, @abosselut.bsky.social
INCLUDE-base: up to 550 samples per language, totaling ~23K questions
🤗 : huggingface.co/datasets/Coh...
INCLUDE-lite: up to 250 samples per language, totaling ~11K questions
🤗 : huggingface.co/datasets/Coh...
INCLUDE-base: up to 550 samples per language, totaling ~23K questions
🤗 : huggingface.co/datasets/Coh...
INCLUDE-lite: up to 250 samples per language, totaling ~11K questions
🤗 : huggingface.co/datasets/Coh...
🌎 Models can struggle with non-English instructions, entangling knowledge evaluation with other factors such as task formatting.
🌎 Models can struggle with non-English instructions, entangling knowledge evaluation with other factors such as task formatting.
📚 Models have a long way to go in capturing the regional knowledge reflected in languages.
💪 Model scale improves regional knowledge understanding, but other techniques like CoT or instruction tuning have minimal or negative impacts.
📚 Models have a long way to go in capturing the regional knowledge reflected in languages.
💪 Model scale improves regional knowledge understanding, but other techniques like CoT or instruction tuning have minimal or negative impacts.
Users expect #LLMs to know information relevant to their environments— customs, culture, etc.
To be relevant & relatable, LLMs need to know these nuances. It's not just global knowledge; it's about meeting user needs where they are.
Users expect #LLMs to know information relevant to their environments— customs, culture, etc.
To be relevant & relatable, LLMs need to know these nuances. It's not just global knowledge; it's about meeting user needs where they are.
It's the local info, culture & practices of a regional context. US Law is a great topic, but not as relevant for multilingual LLMs for other regions.
For INCLUDE, we collect regional knowledge rather than translating Western-centric benchmarks.
It's the local info, culture & practices of a regional context. US Law is a great topic, but not as relevant for multilingual LLMs for other regions.
For INCLUDE, we collect regional knowledge rather than translating Western-centric benchmarks.