Xinpeng Wang
xinpeng.bsky.social
Xinpeng Wang
@xinpeng.bsky.social
PhD student @LMU. Eval & LLM Alignment.
https://xinpeng-wang.github.io/
I’m thrilled to share that our paper on mitigating false refusal in language models has been accepted to ICLR 2025 @iclr-conf.bsky.social!

arxiv.org/abs/2410.03415

Joint work with chengzhi, @paul-rottger.bsky.social, @barbaraplank.bsky.social.
January 23, 2025 at 9:34 PM