Lightnews — Scholar-powered news

Pradeep Dasigi

@pdasigi.bsky.social

370 followers 82 following 13 posts

#NLP research @ai2.bsky.social; OLMo post-training
https://pdasigi.github.io/

Posts Replies Media Videos

Pradeep Dasigi

@pdasigi.bsky.social

For each "core skill" we care about, we chose a separate set of "development" and "unseen" evaluations. We tracked the performance of models only on the former during development and evaluated only the final checkpoints on the unseen ones.

November 23, 2024 at 11:53 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news