Katy Felkner
katyfelkner.bsky.social
Katy Felkner
@katyfelkner.bsky.social
(she/her) | #NLProc PhD student | NSF GRFP | LLM bias evalutation | community-engaged ethical AI | advocating for women and LGBTQ+ in STEM | okie living in LA
📢 New Preprint! 📢
arxiv.org/abs/2510.07662

TL;DR: textual entailment and token probability behave very differently as bias evaluation metrics, even on the exact same bias definitions.

Also, I'm looking for summer 2026 research internships in responsible AI - please reach out!
Textual Entailment and Token Probability as Bias Evaluation Metrics
Measurement of social bias in language models is typically by token probability (TP) metrics, which are broadly applicable but have been criticized for their distance from real-world langugage model u...
arxiv.org
October 10, 2025 at 10:15 PM