Mike Zhang
banner
mjjzha.bsky.social
Mike Zhang
@mjjzha.bsky.social
Postdoc — Aalborg University (CPH) 🇩🇰

#NLPxEducation #NLPxHR #NLP

Past:
🇩🇰 IT University of Copenhagen
🇨🇭 Swiss Federal Institute of Technology Lausanne
🇸🇬 National University of Singapore
🇩🇪 NEC
🇳🇱 University of Groningen

🌐 https://jjzha.github.io/
Reposted by Mike Zhang
Are you attending NAACL 2025 and are you interested in low-resource languages and dialects?

Then don't miss our very own @verenablaschke.bsky.social's keynote talk at the WNUT 2025 workshop on May 3rd:

Beyond “noisy” text: How (and why) to process dialect data

🌐 ☀️
noisy-text.github.io/2025/
April 15, 2025 at 9:49 PM
Reposted by Mike Zhang
🚀 We are excited to introduce Kaleidoscope, the largest culturally-authentic exam benchmark.

📌 Most VLM benchmarks are English-centric or rely on translations—missing linguistic & cultural nuance. Kaleidoscope expands in-language multilingual 🌎 & multimodal 👀 VLMs evaluation
April 10, 2025 at 8:24 PM
Reposted by Mike Zhang
NoDaLiDa x Baltic-HLT 2025 is a wrap!

Thank you all for joining for a fruitful conference! Safe trip home and see you in Copenhagen or Vilnius in 2027!!

#nlp #nodalida #baltichlt
March 5, 2025 at 3:11 PM
Reposted by Mike Zhang
NoDaLiDa 2027 will be held at the Center of Language Technology at the University of Copenhagen!!

#nodalida #nlp
March 4, 2025 at 3:23 PM
Reposted by Mike Zhang
Welcome to NoDaLiDa / Baltic-HLT 2025!

After the opening speech (9:00), we're kicking of with the opening keynote by

Prof. @arianna-bis.bsky.social (09:20-10:10): "Not all Language Models need to be Large: Studying Language Evolution and Acquisition with Modern Neural Networks". (in Lääne-Euroopa)
March 3, 2025 at 6:21 AM
Reposted by Mike Zhang
The first day of workshops is almost a wrap! Join us later today at the welcome reception at the Institute of the Estonian Language (maps.app.goo.gl/brzig4jP6ZfZ...) from 18:30 onwards!!

#nlp #nodalida #baltichlt
Google Maps
Find local businesses, view maps and get driving directions in Google Maps.
maps.app.goo.gl
March 2, 2025 at 3:15 PM
Reposted by Mike Zhang
Now conference-approved!

#nlp #nlproc #nodalida #baltichlt #tallinn
March 2, 2025 at 2:03 PM
Reposted by Mike Zhang
Morning!!! We're excited to welcome you in Tallinn!

On March 2nd (Sunday), we're starting with workshops in the Hestia Hotel Europa:

RESOURCEFUL 2025 (9:00-17:00): shorturl.at/HypPv

NB-REAL 2025 (9:00-13:00): nbreal.xyz

NLP4Ecology 2025 (13:30-17:30): nlp4ecology2025.di.unito.it

#nlp #nlproc
March 2, 2025 at 6:23 AM
Reposted by Mike Zhang
Heading to Tallinn for @nodalida.bsky.social! 🇪🇪 We’re presenting our work on:

🇩🇰 Sun, Mar 2: "DaKultur: Evaluating the Cultural Awareness of LLMs for Danish“ (10:30)
🤖 Tue, Mar 4: "SnakModel: Lessons from Training Our Open Danish LLM" (10:45)

Finally, networking over some lovely Estonian soup! 🍲
March 1, 2025 at 9:57 PM
Reposted by Mike Zhang
The Lisbon Machine Learning Summer School (LxMLS) 2025 is now open for applications.

I’ve done the Covid version myself, and I found the content to be very useful. I went to Lisbon on another occasion, which is also a huge recommendation!!

lxmls.it.pt/2025/
LxMLS 2025 - The 15th Lisbon Machine Learning Summer School
lxmls.it.pt
February 27, 2025 at 4:10 PM
The technical report is now out, loads of interesting insights into multilingual LLM pre-training ✍️

arxiv.org/abs/2502.12982

Congrats Longxu, Qian, Fan, Changyu and team!

#nlp
February 27, 2025 at 4:04 PM
This work has now been accepted at #CVPR2025 🤘
(Reposting)

🚀 Introducing All Languages Matter (ALM-Bench): A diverse multilingual and multimodal VQA benchmark spanning 100 languages with 22.7K Q&A pairs. Covering 19 generic and culture-specific domains, ALM-Bench features 4 diverse question types to advance inclusivity in LMMs. 🌍
February 27, 2025 at 4:01 PM
Reposted by Mike Zhang
🚀 Thank you all for waiting! The full program of NoDaLiDa x Baltic-HLT is online:

www.nodalida-bhlt2025.eu/program

#nodalida #baltichlt #nlp #nlproc
NoDaLiDa/Baltic-HLT 2025 - Program
All times are local (GMT+2/UTC+2). See detailed program below.
www.nodalida-bhlt2025.eu
February 18, 2025 at 3:27 PM
Reposted by Mike Zhang
NoDaLiDa/Baltic-HLT is in less than two weeks!

Some of the places you can visit:

Kalamaja, one of Tallinn's oldest districts, is known for its wooden houses and hipster vibe, with Telliskivi Creative City as its cultural hub. Nearby, Noblessner offers waterfront views and diverse dining options.
February 17, 2025 at 1:27 PM
Reposted by Mike Zhang
Looking for a PhD student to come work with me on the ethical implications of NLP from September!

Please share widely and point any interesting students my way! 😊
We are excited to announce a new, funded PhD studentship opportunity, alongside the School of Informatics @edinburgh-uni.bsky.social!

Supervised by @zeerak.bsky.social and starting Sept 2025, the project will examine the ethical implications of natural language processing.

Apply ▶️ edin.ac/40PAXEq
February 11, 2025 at 2:57 PM
The NLP group at Aalborg University (*Copenhagen campus*) is hiring a postdoc in NLP Security:

(ddl: 15 April)

www.vacancies.aau.dk/scientific-p...

#nlproc #nlp #aau
POSTDOC IN NATURAL LANGUAGE PROCESSING SECURITY (NLPSec) (2025-224-06224)
The postdoc will be working with the Natural Language Processing(NLP) team in the Data, Knowledge, and Web Engineering(...
www.vacancies.aau.dk
February 10, 2025 at 8:50 AM
Reposted by Mike Zhang
Otherwise there is also the Pierre Chocolaterie in the hidden Masters’ Courtyard, or a number of other trendy establishments. When it comes to views, you can’t beat those from the Old Town Wall, its towers and Toompea Hill’s viewing platforms!!

See you soon!

#nodalida #baltichlt #nlp #nlproc
February 10, 2025 at 8:42 AM
Reposted by Mike Zhang
NoDaLiDa/Baltic-HLT is less than a month away!

Did you know Talllinn is a living UNESCO treasure and also has a café culture? One such example is Maiasmokk, the oldest café in Tallinn dating back to 1864!
February 10, 2025 at 8:42 AM
Hi folks, in collaboration with @cohereforai.bsky.social, we're looking for contributors to a Multilingual **Multimodal** Exams benchmark in MCQ style. What's in it for you:

Submit 1000Qs for high/mid-resource, or for 500 low-resource langs to be eligible for co-authorship.
January 22, 2025 at 12:33 PM
Reposted by Mike Zhang
9.6 million seconds = 1 PhD 🔥

Finally analyzed my PhD time tracking data so you can plan your own research journey more effectively: mxij.me/x/phd-learning-dynamics

For current students: I hope this helps put your journey into perspective. Wishing you all the best!
The Learning Dynamics of a PhD
This is what a PhD looks like: 9.6 million seconds of research.
mxij.me
December 23, 2024 at 10:08 PM
Reposted by Mike Zhang
Nice to see @mjjzha.bsky.social presenting our joint collaboration with Aallborg University: SnakModel, a new language model for Danish 🇩🇰
(w/ @mxijme.bsky.social @elisabassignana.bsky.social and Rob van der Goot)
Very happy to have learned about my old @nlpnorth.bsky.social colleague @mjjzha.bsky.social et al.'s work on SnakModel, a new language model for Danish!
December 19, 2024 at 11:01 AM
Thanks for the warm invite @dnnslmr.bsky.social !!!
Very happy to have learned about my old @nlpnorth.bsky.social colleague @mjjzha.bsky.social et al.'s work on SnakModel, a new language model for Danish!
December 19, 2024 at 12:33 PM
Reposted by Mike Zhang
Very happy to have learned about my old @nlpnorth.bsky.social colleague @mjjzha.bsky.social et al.'s work on SnakModel, a new language model for Danish!
December 19, 2024 at 10:20 AM