Lightnews — Scholar-powered news

Ruchira Dhar

@eclecticruchira.bsky.social

PhD Fellow in AI Evals @UniCopenhagen.
Interested in AI Policy/ AI Ethics/ Responsible AI.
Community Lead @cohereforai.bsky.social
Site: ruchiradhar.github.io
#nlproc #llm #ai

Posts Replies Media Videos

Ruchira Dhar

@eclecticruchira.bsky.social

💡 What we propose:

EvalCards to report model evaluations. They’re designed to be:
✅ Easy to write
✅ Easy to understand
✅ Hard to miss
Each card summarizes capabilities, safety tests, metrics, prompts & key notes. Here’s a sample for an OLMo model from @allen_ai!

November 13, 2025 at 4:08 PM

Ruchira Dhar

@eclecticruchira.bsky.social

🌟 New Paper Alert: EvalCards 🌟

Excited to share our new paper, “EvalCards: A Framework for Standardized Evaluation Reporting”is accepted for presentation at the @EurIPSConf workshop on "The Science of Benchmarking and Evaluating AI" .

November 13, 2025 at 4:08 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news