Lightnews — Scholar-powered news

Christina

@christinabaek.bsky.social

110 followers 35 following 1 posts

PhD at CMU / robust ML

Posts Replies Media Videos

Christina

@christinabaek.bsky.social

I’m imagining a simpler setup where words are each a single token long and examples are each a random list of 15 words. If pretrained models already encode the notion of offensive, I bet one iteration of DPO with the right hyperparameter can solve this task.

November 22, 2024 at 7:41 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news