(Basque Country)
Researcher at orainlp.bsky.social -ko ikertzailea (PhD)
#NLP: pretraining LMs & low-resource & tokenization
🇵🇲 🇪🇸 🏴 🔜 🇫🇷 🇳🇴
toka/el/he
🍉
This is how San Mamés bid farewell to the Palestinian national football team.
#ofizialtasuna #FreePalestine
This is how San Mamés bid farewell to the Palestinian national football team.
#ofizialtasuna #FreePalestine
https://b.eus/81af64...
https://b.eus/81af64...
50,000 fans are expected to attend a historic game on saturday November 15 in Bilbao.
By El País in english
english.elpais.com/sports/2025-...
50,000 fans are expected to attend a historic game on saturday November 15 in Bilbao.
By El País in english
english.elpais.com/sports/2025-...
🎧 Entzun⤵️
🔗 eitb.eus/N_9VjAzv/
Eta eskatu sarbidea❗️👉 kimu.orai.eus -en
🎧 Entzun⤵️
🔗 eitb.eus/N_9VjAzv/
Eta eskatu sarbidea❗️👉 kimu.orai.eus -en
🗞️🔗 zientzia.eus/artikul...
🗞️🔗 zientzia.eus/artikul...
Researchers studied more than 1,000 speakers of 29 languages to see how they use demonstratives—words that show where something is in relation to the person talking (“this cat”, “that dog”).
Researchers studied more than 1,000 speakers of 29 languages to see how they use demonstratives—words that show where something is in relation to the person talking (“this cat”, “that dog”).
#MRL2025
💡 Sub-1B Language Models for #Low_Resource #Languages: Training Strategies and Insights for #Basque
#LLMs
🔗 aclanthology.org/2025.mrl-mai...
#MRL2025
💡 Sub-1B Language Models for #Low_Resource #Languages: Training Strategies and Insights for #Basque
#LLMs
🔗 aclanthology.org/2025.mrl-mai...
Come check it out!
📝 aclanthology.org/2025.mrl-mai...
Come check it out!
📝 aclanthology.org/2025.mrl-mai...
We trained 3 models - 1.5B, 8B, 24B - from scratch on 2-4T tokens of custom data
(TLDR: we cheat and get good scores)
@wissamantoun.bsky.social @rachelbawden.bsky.social @bensagot.bsky.social @zehavoc.bsky.social
We trained 3 models - 1.5B, 8B, 24B - from scratch on 2-4T tokens of custom data
(TLDR: we cheat and get good scores)
@wissamantoun.bsky.social @rachelbawden.bsky.social @bensagot.bsky.social @zehavoc.bsky.social
https://b.eus/2760d4...
https://b.eus/2760d4...
📺Según admitió esta plataforma a ‘The Intercept’, la eliminación de las cuentas y los vídeos es una consecuencia directa de las sanciones impuestas por EEUU.
elsal.to/44908
📺Según admitió esta plataforma a ‘The Intercept’, la eliminación de las cuentas y los vídeos es una consecuencia directa de las sanciones impuestas por EEUU.
elsal.to/44908
Main Link | Techmeme Permalink
El gobierno de España abre 2 cárceles de migrantes en Mauritania. Las obras corrieron a cargo de la agencia de cooperación FIAP (Min. Asuntos Exteriores). Ambos centros de detención tienen cunas para bebés.
Vía @elsaltodiario.com
🧵 HILO ⬇️
www.elsaltodiario.com/fronteras/go...
El gobierno de España abre 2 cárceles de migrantes en Mauritania. Las obras corrieron a cargo de la agencia de cooperación FIAP (Min. Asuntos Exteriores). Ambos centros de detención tienen cunas para bebés.
Vía @elsaltodiario.com
🧵 HILO ⬇️
www.elsaltodiario.com/fronteras/go...
👏👏Zorte on, Gorka! Lan bikaina egin duzu!
ℹ️Tesi-zuzendariak: @orainlp.bsky.social -eko Xabier Saralegi Urizar eta @hitz-zentroa.bsky.social -eko Aitor Soroa Etxabe
#AA #LLM
👏👏Zorte on, Gorka! Lan bikaina egin duzu!
ℹ️Tesi-zuzendariak: @orainlp.bsky.social -eko Xabier Saralegi Urizar eta @hitz-zentroa.bsky.social -eko Aitor Soroa Etxabe
#AA #LLM
LLMs learn from vastly more data than humans ever experience. BabyLM challenges this paradigm by focusing on developmentally plausible data
We extend this effort to 45 new languages!
LLMs learn from vastly more data than humans ever experience. BabyLM challenges this paradigm by focusing on developmentally plausible data
We extend this effort to 45 new languages!