Varvara Arzt
banner
kleines-gespenst.bsky.social
Varvara Arzt
@kleines-gespenst.bsky.social
Doing PhD in LM interpretability at TU Wien. Views are mine ☮️
Check out Global PIQA for over 100 languages! Happy to have contributed to the project along with @sarahsu.bsky.social, @allanhanbury.bsky.social & Terra Blevins by putting Albanian on the map ✨
Introducing Global PIQA, a new multilingual benchmark for 100+ languages. This benchmark is the outcome of this year’s MRL shared task, in collaboration with 300+ researchers from 65 countries. This dataset evaluates physical commonsense reasoning in culturally relevant contexts.
October 29, 2025 at 9:49 PM
Relation Extraction or Pattern Matching? How well do RE models generalise to OOD data? We find that higher in-distribution scores do not necessarily translate to better transferability.

Paper: arxiv.org/abs/2505.12533
May 20, 2025 at 1:47 PM