Lightnews — Scholar-powered news

Dylan Sam

@dsam99.bsky.social

1.1K followers 110 following 3 posts

Machine Learning PhD Student at CMU | Student Researcher at Google | dsam99.github.io

Posts Replies Media Videos

Dylan Sam

@dsam99.bsky.social

A very interesting paper with insights into understanding when and why synthetic data (although imperfect and biased) can boost the performance of statistical inference!! 📈

Emily Byun @yewonbyun.bsky.social · Oct 10

💡Can we trust synthetic data for statistical inference?

We show that synthetic data (e.g., LLM simulations) can significantly improve the performance of inference tasks. The key intuition lies in the interactions between the moment residuals of synthetic data and those of real data

October 10, 2025 at 5:44 PM

Reposted by Dylan Sam

Yuda Song

@yus167.bsky.social

LLM self-improvement has critical implications in synthetic data, post-training and test-time inference. To understand LLMs' true capability of self-improvement, we perform large-scale experiments with multiple families of LLMs, tasks and mechanisms. Here is what we found: (1/9)

December 6, 2024 at 6:02 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news