Lightnews — Scholar-powered news

Serina Chang

@serinachang5.bsky.social

Incoming Assistant Professor at UC Berkeley in CS and Computational Precision Health. Postdoc at Microsoft Research, PhD in CS at Stanford. Research in AI, graphs, public health, and computational social science.

https://serinachang5.github.io/

Posts Replies Media Videos

Serina Chang

@serinachang5.bsky.social

Blog post on exciting research happening at MSR, including our recent work ChatBench on human-AI vs AI-alone evaluation!

Microsoft Research @msftresearch.bsky.social · Apr 23

In this issue: our CHI 2025 & ICLR 2025 contributions, plus research on causal reasoning & LLMs; countering LLM jailbreak attacks; and how people use AI vs. AI-alone. Also, SVP of Microsoft Health Jim Weinstein talks rural healthcare innovation: msft.it/6013SHuu1

April 25, 2025 at 12:05 AM

Serina Chang

@serinachang5.bsky.social

Looking forward to taking part in this CHI'25 panel organized by @angelhwang.bsky.social !!

Angel Hsing-Chi Hwang @angelhwang.bsky.social · Apr 22

📣 Calling all #CHI2025 attendees who work with human participants: Join our panel discussion on #LLM, #simulation, #syntheticdata, and the future of human subjects research on Apr 30 (Wed), 2:10 - 3:40 PM (JP Time)

Post your questions for panelists here: forms.gle/m2mXY3xFafAX...

April 22, 2025 at 6:55 PM

Serina Chang

@serinachang5.bsky.social

1st post on bsky!

What happens when a static benchmark comes to life? ✨ Introducing ChatBench, a large-scale user study where we *converted* MMLU questions into thousands of user-AI conversations. Then, we trained a user simulator on ChatBench to generate user-AI outcomes on unseen questions. 1/ 🧵

April 11, 2025 at 5:57 PM

Reposted by Serina Chang

jake hofman

@jakehofman.bsky.social

Check out ChatBench, our new paper+dataset. We turned AI benchmarks into user-AI chats and show that AI-alone evals often fail to predict how real humans perform with AI.
@serinachang5.bsky.social @ashtonanderson.bsky.social

serinachang5.github.io/assets/files...
huggingface.co/datasets/mic...

serinachang5.github.io

April 9, 2025 at 9:29 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news