arxiv.org/abs/2510.07662
TL;DR: textual entailment and token probability behave very differently as bias evaluation metrics, even on the exact same bias definitions.
Also, I'm looking for summer 2026 research internships in responsible AI - please reach out!
arxiv.org/abs/2510.07662
TL;DR: textual entailment and token probability behave very differently as bias evaluation metrics, even on the exact same bias definitions.
Also, I'm looking for summer 2026 research internships in responsible AI - please reach out!