Lightnews — Scholar-powered news

Light up
your news

About Privacy Terms Help

Alexander Rubinstein

@arubique.bsky.social

1 followers 5 following 7 posts

PhD student at the University of Tübingen and IMPRS-IS

Posts Replies Media Videos

Pinned

Alexander Rubinstein @arubique.bsky.social · Oct 10

🪩 Evaluate your LLMs on benchmarks like MMLU at 1% cost.

In our new paper, we show that outputs on a small subset of test samples that maximise diversity in model responses are predictive of the full dataset performance.

Project page: arubique.github.io/disco-site/

More below 🧵👇

Alexander Rubinstein

@arubique.bsky.social

🪩 Evaluate your LLMs on benchmarks like MMLU at 1% cost.

In our new paper, we show that outputs on a small subset of test samples that maximise diversity in model responses are predictive of the full dataset performance.

Project page: arubique.github.io/disco-site/

More below 🧵👇

October 10, 2025 at 9:42 AM

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news