Afra Amini
afraamn.bsky.social
Afra Amini
@afraamn.bsky.social
Ph.D. Student @ ETH Zürich
Current KL estimation practices in RLHF can generate high variance and even negative values! We propose a provably better estimator that only takes a few lines of code to implement.🧵👇
w/ @xtimv.bsky.social and Ryan Cotterell
code: arxiv.org/pdf/2504.10637
paper: github.com/rycolab/kl-rb
May 6, 2025 at 2:59 PM