Pieter Delobelle
banner
pieter.ai
Pieter Delobelle
@pieter.ai
LLM engineer at Aleph Alpha | 👨‍💻 Fairness in LLMs and Dutch NLP | Prev. apple, PhD & postdoc from KU Leuven

pieter.ai
So while I believe our use for tweety (and even my RobBERT model trained in 2019) is well within the law, it is a worrying precedent set by Brein.

geitje’s blog post here: goingdutch.ai/en/posts/gei...
The end of GEITje 1
At the pressing request of Stichting BREIN, GEITje is no longer available as of today. All model files have been removed from my HuggingFace repositories1. GEITje was a Dutch-language large open langu...
goingdutch.ai
January 30, 2025 at 12:47 PM
.. instead of uni-backed Dutch LLMs like Fietje-2b by @bramvanroy.bsky.social (KUL) or our tweety-7b-dutch (KUL & UGent).

How copyright applies to LLMs is not so clearcut (it protects works from unauthorised distribution), since LLMs do not repeat training data unless severely oversampled.
January 30, 2025 at 12:47 PM
Not super multilingual, but for Dutch, German, French and English (all Belgian languages 🇧🇪) there is is this variant: huggingface.co/Parallia/Fai...
Parallia/Fairly-Multilingual-ModernBERT-Embed-BE · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
January 10, 2025 at 2:38 PM
Reposted by Pieter Delobelle
TweetyIta and ItaEval are a language model and evaluation benchmark for Italian tasks. What's more, they are 100% community-driven and born within RiTA (rita-nlp.org). @asantilli.bsky.social will present the poster on Dec 5, 16:30-17:30.

+ Pieter Delobelle, Moreno La Quatra, @bsavoldi.bsky.social
December 4, 2024 at 2:44 PM