Lightnews — Scholar-powered news

Reposted by Dylan Foster 🐢

Chris Paxton

@cpaxton.bsky.social

New work in why action chunking is so important for robot control (it helps fight compounding error)

arxiv.org/abs/2507.09061

December 16, 2025 at 11:55 PM

Reposted by Dylan Foster 🐢

Nathan Lambert

@natolambert.bsky.social

Building Olmo 3 Think
Foundations of Reasoning in Language Models @ NeurIPS 2025
Today 13:45 - 14:30

December 7, 2025 at 3:26 PM

Reposted by Dylan Foster 🐢

let-all.com

@let-all.com

At #NeurIPS2025? Join us for a Social on Wednesday at 7 PM, featuring a fireside chat with Jon Kleinberg and mentoring tables.

Ft. mentors @djfoster.bsky.social @surbhigoel.bsky.social @aifi.bsky.social @gautamkamath.com and more!

November 26, 2025 at 8:47 PM

Dylan Foster 🐢

@djfoster.bsky.social

The coverage principle: How pre-training enables post-training

New preprint where we look at the mechanisms through which next-token prediction produces models that succeed at downstream tasks.

The answer involves a metric we call the "coverage profile", not cross-entropy.

October 25, 2025 at 4:20 PM

Reposted by Dylan Foster 🐢

Aviad Rubinstein

@aviad-rubinstein.bsky.social

The new call for Motwani postdocs application is now open!
academicjobsonline.org/ajo/jobs/30865

BTW-

Not quite ready for a postdoc? We updated the TCS Masters programs spreadsheet:
www.cs.princeton.edu/~smattw/mast...

Any career stage and in the (SF) Bay Area?
Save the date for TOCA-SV on 11/7!

Stanford University, Computer Science/Theory Lab/Stanford University

Job #AJO30865, Postdoc in Theoretical Computer Science at Stanford, Computer Science/Theory Lab/Stanford University, Stanford University, Stanford, California, US

academicjobsonline.org

October 13, 2025 at 8:42 PM

Dylan Foster 🐢

@djfoster.bsky.social

Taming Imperfect Process Verifiers: A Sampling Perspective on Backtracking.

A totally new framework based on ~backtracking~ for using process verifiers to guide inference, w/ connections to approximate counting/sampling in theoretical CS.

Paper: www.arxiv.org/abs/2510.03149

October 12, 2025 at 4:26 PM

Dylan Foster 🐢

@djfoster.bsky.social

MSR NYC is hiring spring and summer interns in AI/ML/RL!

Apply here: jobs.careers.microsoft.com/global/en/jo...

Microsoft Research Lab - New York City - Microsoft Research

Apply for a research position at Microsoft Research New York & collaborate with academia to advance economics research, prediction markets & ML.

www.microsoft.com

October 2, 2025 at 8:58 PM

Reposted by Dylan Foster 🐢

Miro Dudik

@mdudik.bsky.social

🚨Microsoft Research NYC is hiring🚨

We're hiring postdocs and senior researchers in AI/ML broadly, and in specific areas like test-time scaling and science of DL. Postdoc applications due Oct 22, 2025. Senior researcher applications considered on a rolling basis.

Links to apply: aka.ms/msrnyc-jobs

Microsoft Research Lab - New York City - Microsoft Research

Apply for a research position at Microsoft Research New York & collaborate with academia to advance economics research, prediction markets & ML.

aka.ms

September 18, 2025 at 2:37 PM

Dylan Foster 🐢

@djfoster.bsky.social

Microsoft Research New York City (www.microsoft.com/en-us/resear...) is seeking applicants for multiple Postdoctoral Researcher positions in ML/AI!

These are positions for up to 2 years, starting in July 2026.

Application deadline: October 22, 2025

Microsoft Research Lab - New York City - Microsoft Research

Apply for a research position at Microsoft Research New York & collaborate with academia to advance economics research, prediction markets & ML.

www.microsoft.com

September 12, 2025 at 2:57 PM

Dylan Foster 🐢

@djfoster.bsky.social

Quick reminder: The deadline for our workshop on Foundations of Reasoning in Language Models (FoRLM) at NeurIPS 2025 is next Wednesday, Sept 3!

Dylan Foster 🐢 @djfoster.bsky.social · Aug 11

Announcing the first workshop on Foundations of Language Model Reasoning (FoRLM) at NeurIPS 2025!

📝Soliciting abstracts that advance foundational understanding of reasoning in language models, from theoretical analyses to rigorous empirical studies.

📆 Deadline: Sept 3, 2025

August 27, 2025 at 6:51 PM

Dylan Foster 🐢

@djfoster.bsky.social

Announcing the first workshop on Foundations of Language Model Reasoning (FoRLM) at NeurIPS 2025!

📝Soliciting abstracts that advance foundational understanding of reasoning in language models, from theoretical analyses to rigorous empirical studies.

📆 Deadline: Sept 3, 2025

August 11, 2025 at 3:40 PM

Dylan Foster 🐢

@djfoster.bsky.social

For those at ICML, Audrey will be presenting this paper at the 4:30pm poster session this afternoon! West Exhibition Hall B2-B3 W-1009

July 15, 2025 at 3:59 PM

Reposted by Dylan Foster 🐢

Gautam Kamath

@gautamkamath.com

ICML's election for their board of directors has begun. I've thrown my hat in the ring. Please consider voting for Gautam Kamath.

I have experience with the governance of TMLR, COLT, and ALT, and I think I've demonstrated myself as a consciencious and engaged community member.

June 30, 2025 at 12:44 PM

Reposted by Dylan Foster 🐢

Tom Silver

@tomssilver.bsky.social

This week's #PaperILike is "The Power of Resets in Online Reinforcement Learning" (Mhammedi et al., 2024).

If you're doing RL in sim, why not use the sim to its full potential? Reset to any state! (gym.Env.reset() is not all we need.)

PDF: arxiv.org/abs/2404.15417

The Power of Resets in Online Reinforcement Learning

Simulators are a pervasive tool in reinforcement learning, but most existing algorithms cannot efficiently exploit simulator access -- particularly in high-dimensional domains that require general fun...

arxiv.org

June 29, 2025 at 1:08 PM

Reposted by Dylan Foster 🐢

let-all.com

@let-all.com

📣Join us at COLT 2025 in Lyon for a community event!
📅When: Mon, June 30 | 16:00 CET
What: Fireside chat w/ Peter Bartlett & Vitaly Feldman on communicating a research agenda, followed by mentorship roundtable to practice elevator pitches & mingle w/ COLT community!
let-all.com/colt25.html

June 24, 2025 at 6:22 PM

Reposted by Dylan Foster 🐢

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

Hiring a postdoc to scale up and deploy RL-based planning onto some self-driving cars! We'll be building on arxiv.org/abs/2502.03349 and learn what the limits and challenges of RL planning are. Shoot me a message if interested and help spread the word please!

Full posting to come in a bit.

Robust Autonomy Emerges from Self-Play

Self-play has powered breakthroughs in two-player and multi-player games. Here we show that self-play is a surprisingly effective strategy in another domain. We show that robust and naturalistic drivi...

arxiv.org

June 21, 2025 at 5:14 PM

Reposted by Dylan Foster 🐢

Jason Hartline

@jasonhartline.bsky.social

At the IDEAL annual meeting and saw this paper presented. Basically: reducing length of chain of thought LLM computations by deleting intermediate computations, more like classical functional programming where only function call and return values are important.

arxiv.org/abs/2503.14337

PENCIL: Long Thoughts with Short Memory

While recent works (e.g. o1, DeepSeek R1) have demonstrated great promise of using long Chain-of-Thought (CoT) to improve reasoning capabilities of language models, scaling it up during test-time is c...

arxiv.org

June 9, 2025 at 2:54 PM

Reposted by Dylan Foster 🐢

Clément Canonne

@ccanonne.github.io

RADEMACHER CHAOS 🤘

June 4, 2025 at 1:27 PM

Dylan Foster 🐢

@djfoster.bsky.social

Dhruv Rohatgi will be giving a lecture on our recent work on comp-stat tradeoffs in next-token prediction at the RL Theory virtual seminar series (rl-theory.bsky.social) tomorrow at 2pm EST! Should be a fun talk---come check it out!!

May 26, 2025 at 7:19 PM

Reposted by Dylan Foster 🐢

RL Theory Virtual Seminars

@rl-theory.bsky.social

Later today, Sikata and Marcel will talk about their recent work on oracle-efficient RL with ensembles. Join us!

May 20, 2025 at 3:48 PM

Dylan Foster 🐢

@djfoster.bsky.social

The abstract submission deadline for FoPt has been extended to the 21st of May (11:59pm UTC).

Submission website: openreview.net/group?id=lea...

Dylan Foster 🐢 @djfoster.bsky.social · May 9

Announcing the first workshop on Foundations of Post-Training (FoPT) at COLT 2025!

📝 Soliciting abstracts/posters exploring theoretical & practical aspects of post-training and RL with language models!

🗓️ Deadline: May 19, 2025

May 19, 2025 at 6:27 PM

Reposted by Dylan Foster 🐢

Dylan Foster 🐢

@djfoster.bsky.social

Announcing the first workshop on Foundations of Post-Training (FoPT) at COLT 2025!

📝 Soliciting abstracts/posters exploring theoretical & practical aspects of post-training and RL with language models!

🗓️ Deadline: May 19, 2025

May 9, 2025 at 5:10 PM

Dylan Foster 🐢

@djfoster.bsky.social

Announcing the first workshop on Foundations of Post-Training (FoPT) at COLT 2025!

📝 Soliciting abstracts/posters exploring theoretical & practical aspects of post-training and RL with language models!

🗓️ Deadline: May 19, 2025

May 9, 2025 at 5:10 PM

Dylan Foster 🐢

@djfoster.bsky.social

Is Best-of-N really the best we can do for language model inference?

New paper (appearing at ICML) led by the amazing Audrey Huang (ahahaudrey.bsky.social) with Adam Block, Qinghua Liu, Nan Jiang, and Akshay Krishnamurthy (akshaykr.bsky.social).

1/11

May 3, 2025 at 5:40 PM

Reposted by Dylan Foster 🐢

RL Theory Virtual Seminars

@rl-theory.bsky.social

Last seminars before the summer break:

04/29: Max Simchowitz (CMU)
05/06: Jeongyeol Kwon (Univ. of Widsconsin-Madison)
05/20: Sikata Sengupta & Marcel Hussing (Univ. of Pennsylvania)
05/27: Dhruv Rohatgi (MIT)
06/03: David Janz (Univ. of Oxford)
06/10: Nneka Okolo (MIT)

April 16, 2025 at 5:20 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news