TurkuNLP
banner
turkunlp.bsky.social
TurkuNLP
@turkunlp.bsky.social
The TurkuNLP Group is an interdisciplinary group of researchers at the University of Turku and the UTU graduate school (UTUGS). We do research on various aspects of natural language processing, language technology and digital linguistics.
TurkuNLP member Risto Luukkonen's MSc thesis has been selected as one of three contenders for the best AI thesis of the year by AI Finland! 🎉The winner will be announced in the AI Gala next week. aifinland.fi/ai-gaala-202...
AI-gaala 2025 finalistit julki – tässä ovat Suomen tekoälykentän kärkinimet ja käyttötapaukset - AI Finland
17.11. järjestettävä AI-gaala tuo yhteen 500 yritysjohtajaa, asiantuntijaa ja innovaattoria juhlistamaan tekoälyn merkittäviä saavutuksia ja tulevaisuuden mahdollisuuksia Suomessa. Kansainvälisen AI S...
aifinland.fi
November 12, 2025 at 12:23 PM
Reposted by TurkuNLP
FAIR Science Café @csc.fi is an interactive online event where researchers present their work highlighting data used or produced. On Nov 21. Tomasz Galica @turkunlp.bsky.social talks about developing and evaluating LLMs, training datasets and risks. Info & sign up: www.dariah.fi/event/fair-s...
FAIR Science Café
FAIR Science Café is a informal and interactive online event where researchers get to talk about their work, research, and results in their own words. The discussion also highlights the data used, …
www.dariah.fi
November 12, 2025 at 9:35 AM
Our experts contributed to the latest #HPLT dataset publication, which contains some very interesting results! See here: t.co/uN2zoSF251 #DataScience
November 6, 2025 at 2:47 PM
Suomen Akatemia valitsi uudeksi huippuyksiköksi Virpi Lummaan johtaman ihmisen monimuotoisuutta tutkivan yksikön (2026-2033), jossa ovat mukana Veronika Laippala, Päivi Onkamo ja Outi Vesakoski. Tutkimuksen huippuyksiköt kuuluvat oman tieteenalansa kansainväliseen kärkeen. www.utu.fi/fi/ajankohta...
Turun yliopistoon uusi Virpi Lummaan johtama huippuyksikkö
Suomen Akatemia valitsi uudeksi huippuyksiköksi Virpi Lummaan johtaman yksikön, jossa ovat mukana Veronika Laippala, Päivi Onkamo ja Outi Vesakoski.
www.utu.fi
October 31, 2025 at 8:19 AM
Doctoral students from TurkuNLP together with people from DigiTS Tartu are planning a workshop on presentation skills specifically for DH researchers! We are grateful for #TurkuUniversityFoundation for the Villa Tammekann grant for hosting the upcoming workshop next autumn. ♥️ Looking forward to it!
October 27, 2025 at 12:26 PM
(Nojonen, Korsu, Ginter, Laippala & Kanerva 2025) introduce TCBLex, a lexical database of Finnish literary works read by children (7-15y). Data consists of 14 sub-lexicons and over 11 million tokens, annotated and lemmatized.
Paper: link.springer.com/article/10.3...
Data: doi.org/10.5281/zeno...
TCBLex - A lexical database of Finnish literary texts for children - Behavior Research Methods
This work introduces TCBLex, a lexical database of Finnish literary works read by children between the ages of 7 and 15. We explain in detail the work done to build the corpus TCBLex is based on, incl...
link.springer.com
October 20, 2025 at 8:48 AM
Two articles by TurkuNLP members have been published in a book about the linguistic landscape of Turku, except that (Kupari & Lamberg 2025) and (Ristilä 2025) have turned the tables and observed the "landscape in language". The book is available for free online here: oa.finlit.fi/books/e/10.2...
October 13, 2025 at 7:09 AM
Our Doctoral Researcher Otto Tarkka (@ottotarkka.bsky.social) visited CSC facilities in Kajaani last month on a trip organized by FIN-CLARIAH. "It was great to meet new people and hear how CSC computers are used in a wide variety of research projects."
October 13, 2025 at 6:52 AM
Our Latin expert, Hanna-Mari Kupari, presented at the Norwegian Institute in Rome on "Latin Across Registers: A Computational Analysis of Situational Language Use Reflected in Grammar". See the slides and abstract here:
github.com/HannaKoo/Nor...
September 29, 2025 at 10:32 AM
Teimouri, Kanerva & Ginter (2025) published insights for model interpretability in their study of a multi-attention head model, showing that heads capture distinct semantics and deeper layers enhance separation but pooling can blur patterns: acl-bg.org/proceedings/...
acl-bg.org
September 29, 2025 at 7:21 AM
Maryam from TurkuNLP participated in #RANLP2025 (Recent advances in Natural Language Processing) and their team won a competition where they were to create a solution for a hate speech classifier for 5 low resource languages. 🏆Congrats!
September 15, 2025 at 12:31 PM
Reposted by TurkuNLP
Miksi Eurooppa tarvitsee omia kielimalleja tekoälyn aikakaudella, tutkija Sampo Pyysalo?
Miksi Eurooppa tarvitsee omia kielimalleja tekoälyn aikakaudella, tutkija Sampo Pyysalo?
suomenkuvalehti.fi
April 27, 2025 at 9:41 AM
Reposted by TurkuNLP
Mitkä ovat tutkijoidemme suurimmat toiveet ja pahimmat pelot tekoälyn suhteen?

🎧 Kuuntele Tiedelinja-podcastimme uusin jakso, jossa data-analytiikan professori Filip Ginter ja vararehtori Tapio Salakoski keskustelevat tekoälystä.

👉 Kuuntele Tiedelinja-podcastia: www.utu.fi/fi/ajankohta...
January 24, 2025 at 8:43 AM
TurkuNLP was at Corpus Linguistics Conference 2025! #CL2025 Pictures of some of our participants by Hanna-Mari Kupari and Jiaqi Guo. Search the book of abstracts for "University of Turku" to read more about our contributions: drive.google.com/file/d/1TiwO... Thank you @cl2025.co.uk!
July 8, 2025 at 9:19 AM
TurkuNLP leads the central work package on building LLMs within OpenEuroLLM.
openeurollm.eu/blog/LUMI-Ex...
OpenEuroLLM
A series of foundation models for transparent AI in Europe
openeurollm.eu
May 30, 2025 at 6:59 AM
Our recent paper on the impact of register (genre) on LLM performance. Key points: news do poor in evaluation, while opinionated texts are among the best. We hope this work can be used to understand the impact of register on LLMs and improve training data mixes! arxiv.org/abs/2504.01542
April 15, 2025 at 12:57 PM
TurkuNLP is now on Bluesky! 🎉
April 15, 2025 at 11:31 AM