Ivan Kartáč
ivankartac.bsky.social
Ivan Kartáč
@ivankartac.bsky.social
PhD student @ Charles University. Researching evaluation and explainability of reasoning in language models.
Our paper "OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs" has been accepted to #INLG2025 conference!

You can read the preprint here: arxiv.org/abs/2503.11858
August 23, 2025 at 4:36 PM
Reposted by Ivan Kartáč
#ACL2025NLP in Vienna 🇦🇹 starts today with 23 🤯 @ufal-cuni.bsky.social folks presenting their work both at the main conference and workshops. Check out our main conference papers today and on Wednesday 👇
July 28, 2025 at 7:27 AM
Reposted by Ivan Kartáč
Slides and links to papers at bit.ly/mlprague25-od 🤓
Ondrej Dusek MLPrague 2025
Evaluating LLM outputs with humans and LLMs Ondřej Dušek MLPrague 30 April 2025 These slides: https://bit.ly/mlprague25-od
bit.ly
May 2, 2025 at 7:25 PM
Reposted by Ivan Kartáč
Today, @tuetschek.bsky.social shared the work of his team on evaluating LLM text generation with both human annotation frameworks and LLM-based metrics. Their approach tackles the benchmark data leakage problem and how to get unseen data for unbiased LLM testing.
April 30, 2025 at 12:02 PM
Reposted by Ivan Kartáč
How do LLMs compare to human crowdworkers in annotating text spans? 🧑🤖

And how can span annotation help us with evaluating texts?

Find out in our new paper: llm-span-annotators.github.io

Arxiv: arxiv.org/abs/2504.08697
Large Language Models as Span Annotators
Website for the paper Large Language Models as Span Annotators
llm-span-annotators.github.io
April 15, 2025 at 11:10 AM