Ondrej Dusek
tuetschek.bsky.social
Ondrej Dusek
@tuetschek.bsky.social
Teaching computers to talk at Charles University. (Computational) linguistics, politics, climate, public transit. He/him.
Reposted by Ondrej Dusek
🔤 Pretraining Language Models with LoRA and Artificial Languages
Nalin Kumar, Mateusz Lango, @tuetschek.bsky.social t
aclanthology.org/2025.babylm-...
Constructed artificial languages with LoRA affects language model development.
Pretraining Language Models with LoRA and Artificial Languages
Nalin Kumar, Mateusz Lango, Ondrej Dusek. Proceedings of the First BabyLM Workshop. 2025.
aclanthology.org
November 11, 2025 at 2:37 PM
Reposted by Ondrej Dusek
🎓 You are an LLM teaching a smaller model everything you know: Multi-task pretraining of language models with LLM-designed study plans
Wiktor Kamzela, Mateusz Lango, @tuetschek.bsky.social
aclanthology.org/2025.babylm-...
You are an LLM teaching a smaller model everything you know: Multi-task pretraining of language models with LLM-designed study plans
Wiktor Kamzela, Mateusz Lango, Ondrej Dusek. Proceedings of the First BabyLM Workshop. 2025.
aclanthology.org
November 11, 2025 at 2:37 PM
Reposted by Ondrej Dusek
📚 SRS-Stories: Vocabulary-constrained multilingual story generation for language learning
Wiktor Kamzela, Mateusz Lango & @toonietuesday.bsky.social
aclanthology.org/2025.emnlp-i...
LLM stories teach vocab while reviewing learned words via Spaced Repetition-more grammatical than standard generation
November 7, 2025 at 8:54 PM
Reposted by Ondrej Dusek
🤖 LLM Agents Implement an NLG System from Scratch
Mateusz Lango, Ondrej Dusek
aclanthology.org/2025.emnlp-i...
LLM agents can autonomously build interpretable, rule-based RDF-to-text generators from scratch, combining the LLMs with the transparency and reliability of traditional rule-based systems.
LLM Agents Implement an NLG System from Scratch: Building Interpretable Rule-Based RDF-to-Text Generators
Mateusz Lango, Ondrej Dusek. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track. 2025.
aclanthology.org
November 7, 2025 at 8:54 PM
Reposted by Ondrej Dusek
👥 Can Large Language Models Personalize Dialogues to Generational Styles?
P. Balestrucci, @tuetschek.bsky.social, L. Anselma, A. Mazzei
aclanthology.org/2025.finding...
Can LLMs adapt dialogues to generational styles? We show with P-MultiWoZ that models capture patterns from Boomers to Gen Z.
Can Large Language Models Personalize Dialogues to Generational Styles?
Pier Felice Balestrucci, Ondrej Dusek, Luca Anselma, Alessandro Mazzei. Findings of the Association for Computational Linguistics: EMNLP 2025. 2025.
aclanthology.org
November 7, 2025 at 8:54 PM
Reposted by Ondrej Dusek
📊 Real-World Summarization: When Evaluation Reaches Its Limits
@patuchen.bsky.social , @tuetschek.bsky.social , @saad.me.uk
aclanthology.org/2025.finding...
For hotel highlights, metrics like word overlap surprisingly match human judgments better than complex methods. LLMs unreliable as evaluators.
Real-World Summarization: When Evaluation Reaches Its Limits
Patrícia Schmidtová, Ondrej Dusek, Saad Mahamood. Findings of the Association for Computational Linguistics: EMNLP 2025. 2025.
aclanthology.org
November 7, 2025 at 8:54 PM
It's fine by me if they generate it, as long as it works and they know how... but I've been getting loads of roughly plausible but non-functional code, with hallucinated API calls etc. 😒. Not that many emojis though (in docs only).
August 3, 2025 at 4:22 PM
Reposted by Ondrej Dusek
FreshTab: Sourcing Fresh Resources for Table-to-Text Generation Evaluation
by @navitas.bsky.social, ‪@oplatek.bsky.social‬, ‪@zdenekkasner.bsky.social‬, @tuetschek.bsky.social .bsky.social‬
July 31, 2025 at 1:30 PM
Reposted by Ondrej Dusek
ReproHum #0669-08: Reproducing Sentiment Transfer Evaluation
by @navitas.bsky.social, M. Lango, @patuchen.bsky.social, @tuetschek.bsky.social
Challenge to reproduce human evaluations from NLP papers, testing the reproducibility of evaluation studies
July 31, 2025 at 1:30 PM
Reposted by Ondrej Dusek
OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs
by @ivankartac.bsky.social, M. Lango, @tuetschek.bsky.social
arxiv.org/abs/2503.11858
Open-source NLG evaluation metric that explains errors and matches human judgments without proprietary models
July 31, 2025 at 1:30 PM
Slides and links to papers at bit.ly/mlprague25-od 🤓
Ondrej Dusek MLPrague 2025
Evaluating LLM outputs with humans and LLMs Ondřej Dušek MLPrague 30 April 2025 These slides: https://bit.ly/mlprague25-od
bit.ly
May 2, 2025 at 7:25 PM