Lightnews — Scholar-powered news

Sebastian Joseph

@sebajoe.bsky.social

11 followers 3 following 6 posts

CS Ph.D. Student at UT Austin

Posts Replies Media Videos

Sebastian Joseph

@sebajoe.bsky.social

How good are LLMs at 🔭 scientific computing and visualization 🔭?

AstroVisBench tests how well LLMs implement scientific workflows in astronomy and visualize results.

SOTA models like Gemini 2.5 Pro & Claude 4 Opus only match ground truth scientific utility 16% of the time. 🧵

June 2, 2025 at 3:42 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news