Lightnews — Scholar-powered news

awni.bsky.social

@awni.bsky.social

9 followers 42 following 10 posts

phd student @ yale statistics & data science

studying the foundations of machine intelligence

awni.xyz

Posts Replies Media Videos

awni.bsky.social

@awni.bsky.social

🧪 Theory meets Practice

We empirically validate our theory’s predictions in simple settings where the CoT information can be computed exactly.

We find that the theory closely predicts the sample-efficiency gains.
[8/n]

November 25, 2025 at 4:27 AM

awni.bsky.social

@awni.bsky.social

🧮 The Theory

To distinguish between hypotheses with error ε, classical theory tells us we need roughly O(1/ε) samples.

We prove that under CoT supervision, the sample complexity improves to O(1/CoTInfo(ε)).
[5/n]

Mathematical result on the sample complexity of learning with CoT supervision.

November 25, 2025 at 4:27 AM

awni.bsky.social

@awni.bsky.social

🧠 The Insight: CoT supervision doesn’t just tell the model what to predict; it constrains how it thinks.

We formalize this by introducing the “CoT Information”: a measure of the extra discriminative power gained by observing the reasoning trace, not just the label.
[4/n]

Mathematical definition of the CoT information.

November 25, 2025 at 4:27 AM

awni.bsky.social

@awni.bsky.social

🌟🔗 Spotlight #NeurIPS2025 Paper on the Foundations of Chain-of-Thought Learning 🔗🌟

Excited to share our work developing a learning-theoretic account of the statistical advantage of chain-of-thought supervision in reasoning systems!

Blog: awni.xyz/cot-info

👇🧵
[1/n]

A screenshot of the neurips conference page for this paper.

November 25, 2025 at 4:27 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news