Lightnews — Scholar-powered news

Katy Felkner

@katyfelkner.bsky.social

Posts Replies Media Videos

Katy Felkner

@katyfelkner.bsky.social

📢 New Preprint! 📢
arxiv.org/abs/2510.07662

TL;DR: textual entailment and token probability behave very differently as bias evaluation metrics, even on the exact same bias definitions.

Also, I'm looking for summer 2026 research internships in responsible AI - please reach out!

Textual Entailment and Token Probability as Bias Evaluation Metrics

Measurement of social bias in language models is typically by token probability (TP) metrics, which are broadly applicable but have been criticized for their distance from real-world langugage model u...

arxiv.org

October 10, 2025 at 10:15 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news