Amr Keleg
amr-keleg.bsky.social
Amr Keleg
@amr-keleg.bsky.social
PhD student at the University of Edinburgh.
Co-creator of AlWird (Arabic Wordle).

Research interests: Diversity of Arabic Dialects, Arabic NLP, Multilinguality.

https://amr-keleg.github.io/
According to openreview, our paper got accepted to #ACL2025NLP 🥳🥳🥳
Further details to be shared soon!
May 14, 2025 at 10:07 PM
I share some preliminary thoughts for four steps that could help in building culturally representative models.

Lastly, I hope this will spark discussions within the Arabic NLP community, and the broader NLP community interested in serving marginalized speech communities!

(4/4)
March 21, 2025 at 6:56 PM
* The NLP community acknowledges the rich diversity of the Arabic dialects, which are a manifestation of cultural differences across the region.

* While Arabic-specific LLMs are still marketed as serving all Arabs, our alignment data/benchmarks are scarce and not inclusive enough!

(3/4)
March 21, 2025 at 6:56 PM
* Arabic speakers have substantial cultural similarities (see map below). This does not imply they have one single homogenous culture!

* Their views tend to be ignored, even for largely diverse alignment datasets (e.g., PRISM, Kirk et al., 2024).

(2/4)
March 21, 2025 at 6:55 PM
I share some preliminary thoughts for four steps that could help in building culturally representative models.

Lastly, I hope this will spark discussions within the Arabic NLP community, and the broader NLP community interested in serving marginalized speech communities!

(4/4)
March 21, 2025 at 6:53 PM
* The NLP community acknowledges the rich diversity of the Arabic dialects, which are a manifestation of cultural differences across the region.

* While Arabic-specific LLMs are still marketed as serving all Arabs, our alignment data/benchmarks are scarce and not inclusive enough!

(3/4)
March 21, 2025 at 6:53 PM
* Arabic speakers have substantial cultural similarities (see map below). This does not imply they have one single homogenous culture!

* Their views tend to be ignored, even for largely diverse alignment datasets (e.g., PRISM, Kirk et al., 2024).

(2/4)
March 21, 2025 at 6:53 PM