xiamengzhou.bsky.social
@xiamengzhou.bsky.social
Reposted
SimPO: new method from Princeton PLI for improving chat models via preference data. Simpler than DPO and widely adopted within weeks by top models in the chatbot arena. Excellent and elementary account by author
@xiamengzhou.bsky.social (she's also on job market!). tinyurl.com/pepcynaxFully
tinyurl.com
December 3, 2024 at 2:55 PM