Lightnews — Scholar-powered news

Nikunj Saunshi

@nsaunshi.bsky.social

23 followers 56 following 10 posts

AI Reasoning and Foundations
Senior Research Scientist, Google |
PhD, Princeton University

Posts Replies Media Videos

Nikunj Saunshi

@nsaunshi.bsky.social

First post here, so sharing an earlier NeurIPS '24 paper on stacking and its inductive biases

TLDR: Stacking, i.e. growing model depth gradually, not only improves training efficiency (if done right), but significantly improves downstream tasks that require *reasoning*, at similar perplexity 1/n

March 10, 2025 at 3:41 PM

Reposted by Nikunj Saunshi

Tian Jin

@tjin.bsky.social

Excited to share our work with friends from MIT/Google on Learned Asynchronous Decoding! LLM responses often contain chunks of tokens that are semantically independent. What if we can train LLMs to identify such chunks and decode them in parallel, thereby speeding up inference? 1/N

February 27, 2025 at 12:38 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news