https://xinpeng-wang.github.io/
We propose a surgical & flexible approach to mitigate false refusal in LLMs with minimal effect on performance and inference cost
led by @xinpeng.bsky.social (1/2)
We propose a surgical & flexible approach to mitigate false refusal in LLMs with minimal effect on performance and inference cost
led by @xinpeng.bsky.social (1/2)
arxiv.org/abs/2410.03415
Joint work with chengzhi, @paul-rottger.bsky.social, @barbaraplank.bsky.social.
arxiv.org/abs/2410.03415
Joint work with chengzhi, @paul-rottger.bsky.social, @barbaraplank.bsky.social.