Lightnews — Scholar-powered news

Reposted by Taylor W. Killian

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

The workshops are maybe the best part of RLC. Bring us your workshops that could never happen anywhere else!

Taylor W. Killian @twkillian.bsky.social · 1d

We're thrilled to share that the Call for Workshops for this year's @rl-conference.bsky.social is now live!

As Workshop co-chair (alongside the wonderful Raksha Kumaraswamy and @claireve.bsky.social) we are looking forward to seeing the proposals for workshops that we receive.

LINK IN NEXT POST

February 13, 2026 at 10:07 PM

Reposted by Taylor W. Killian

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

3 weeks to get your RLC papers in shape! And to get the word out to those who have yet to experience an RLC review process.

Reinforcement Learning Conference @rl-conference.bsky.social · 2d

Quick reminder for everyone grinding on their RLC 2026 papers, only ~3 weeks to go!

The submission site opens in just a few days (Feb 17).

Deadlines:

⏳ March 1 (AoE): Abstract Submission
⏳ March 5 (AoE): Full Paper Submission

Good luck with the final changes!

Reinforcement Learning Conference @rl-conference.bsky.social · Dec 23

Hi RL Enthusiasts!

RLC is coming to Montreal, Quebec, in the summer: Aug 16–19, 2026!

Call for Papers is up now:
Abstract: Mar 1 (AOE)
Submission: Mar 5 (AOE)

Excited to see what you’ve been up to - Submit your best work!
rl-conference.cc/callforpaper...

Please share widely!

February 13, 2026 at 4:14 AM

Reposted by Taylor W. Killian

Igor Gilitschenski

@igilitschenski.bsky.social

🚀 Excited to share REPPO, a new on-policy RL agent!

TL;DR: Replace PPO with REPPO for fewer hyperparameter headaches and more robust training.

REPPO, led by @cvoelcker.bsky.social, will be presented at ICLR 2026. How does it work? 🧵👇

February 13, 2026 at 7:29 PM

Taylor W. Killian

@twkillian.bsky.social

We're thrilled to share that the Call for Workshops for this year's @rl-conference.bsky.social is now live!

As Workshop co-chair (alongside the wonderful Raksha Kumaraswamy and @claireve.bsky.social) we are looking forward to seeing the proposals for workshops that we receive.

LINK IN NEXT POST

February 13, 2026 at 9:50 PM

Taylor W. Killian

@twkillian.bsky.social

One paper accepted to ICML with one paper rejected as well. It’s called balance (and measured frustration) #icml2025

May 1, 2025 at 2:00 PM

Reposted by Taylor W. Killian

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

Shipping a copy of Kuhn's "Structure of Scientific Revolutions" to every ICML meta-reviewer

May 1, 2025 at 12:59 PM

Taylor W. Killian

@twkillian.bsky.social

We're putting in final planning and preparation steps this week to be ready for #ICLR2025. I'm excited to be attending, in part to help represent @mbzuai.bsky.social and @llm360.bsky.social as well as recruit Research Scientists and Engineers to our new lab in the Bay Area! (see next for more info)

April 16, 2025 at 5:40 PM

Taylor W. Killian

@twkillian.bsky.social

I used to dream of days like this.

Mowed the lawn, trimmed some hedges, pruned a few trees to let more sun into our yard, picked the rest of our orange tree (we yielded 658 🍊this year!) had the kids help clean up the branches, etc. #suburbandadsaturdays

March 22, 2025 at 8:33 PM

Reposted by Taylor W. Killian

Peter Henderson

@peterhenderson.bsky.social

Real-world RL for the public good! Love to see it!

Eugene Vinitsky 🍒 @eugenevinitsky.bsky.social · Mar 12

A year later of a multi-agent RL controlled variable speed limit system:
arxiv.org/abs/2503.01017
Fewer crashes, quicker responses, quicker warnings

Real-World Deployment and Assessment of a Multi-Agent Reinforcement Learning-Based Variable Speed Limit Control System

This article presents the first field deployment of a multi-agent reinforcement learning (MARL) based variable speed limit (VSL) control system on Interstate 24 (I-24) near Nashville, Tennessee. We de...

arxiv.org

March 12, 2025 at 4:20 AM

Reposted by Taylor W. Killian

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

A year later of a multi-agent RL controlled variable speed limit system:
arxiv.org/abs/2503.01017
Fewer crashes, quicker responses, quicker warnings

Real-World Deployment and Assessment of a Multi-Agent Reinforcement Learning-Based Variable Speed Limit Control System

This article presents the first field deployment of a multi-agent reinforcement learning (MARL) based variable speed limit (VSL) control system on Interstate 24 (I-24) near Nashville, Tennessee. We de...

arxiv.org

March 12, 2025 at 2:14 AM

Taylor W. Killian

@twkillian.bsky.social

Reviewing a paper right now and I'm having lots of "I wish that I'd written this!" feelings. Great sign for the authors tbh.

March 12, 2025 at 4:18 AM

Reposted by Taylor W. Killian

John D. Martin

@jdmartin86.bsky.social

Congratulations Andy and Rich! You've given RL yet another place in the history books. I can't imagine how you're feeling right now, but it must be amazing to reflect on how far the ideas have come, especially after all the care and dedication you gave them.

awards.acm.org/about/2024-t...

awards.acm.org

March 5, 2025 at 9:20 PM

Taylor W. Killian

@twkillian.bsky.social

As I've started down the RL+LLM rabbit hole (for reals this time), this blog post by TensorZero is absolutely compelling. It's nice to see clear thinking conceptual framing that at once open the doors to new research but also leverages existing insights:
www.tensorzero.com/blog/think-o...

Think of LLM Applications as POMDPs — Not Agents · TensorZero

Think of LLM Applications as POMDPs — Not Agents

www.tensorzero.com

March 4, 2025 at 10:29 PM

Taylor W. Killian

@twkillian.bsky.social

Struggled to find a paper to recommend to authors that would strengthen their claims while writing a review. Used @GeminiApp and @youdotcom to no success. Reluctantly pulled up @ChatGPTapp, nailed it in the first returned result. #HesitantAboutLLMsButTryingToLearn

a man with a beard is saying he can 't keep getting away with it !

ALT: a man with a beard is saying he can 't keep getting away with it !

media.tenor.com

March 3, 2025 at 5:19 AM

Reposted by Taylor W. Killian

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

"Scaling laws" tries to piggyback on physics where the scaling laws are resultant from statistical limits. To call them "laws" rather than "empirical scaling curves" is real iffy

February 28, 2025 at 6:41 PM

Taylor W. Killian

@twkillian.bsky.social

Day 1 with @mbzuai.bsky.social as we start building out a new non-profit, open research lab in the Bay Area. Lots to learn and my head is spinning but I’m excited to get started pushing the boundaries of what we know about reasoning and decision making under uncertainty.

February 24, 2025 at 11:22 PM

Reposted by Taylor W. Killian

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

We've built a simulated driving agent that we trained on 1.6 billion km of driving with no human data.
It is SOTA on every planning benchmark we tried.
In self-play, it goes 20 years between collisions.

February 6, 2025 at 6:34 PM

Reposted by Taylor W. Killian

Bernhard Jaeger

@bernhard-jaeger.bsky.social

arxiv.org/abs/2502.03349
Awesome planning results by Vladlen Koltun's new lab (@twkillian.bsky.social, @eugenevinitsky.bsky.social @senerozan.bsky.social and others) that everyone working on driving should read.
Lots of details are in the Appendix.

Robust Autonomy Emerges from Self-Play

Self-play has powered breakthroughs in two-player and multi-player games. Here we show that self-play is a surprisingly effective strategy in another domain. We show that robust and naturalistic drivi...

arxiv.org

February 6, 2025 at 4:14 PM

Taylor W. Killian

@twkillian.bsky.social

📍Provo, Utah

February 4, 2025 at 12:41 AM

Taylor W. Killian

@twkillian.bsky.social

This afternoon I've been putting finishing touches on a talk that I've been invited to give at my undergrad Alma Mater (BYU Math) next Tues.

Since graduating, I've been hoping for an opportunity to return & share how my career has grown from the solid foundation I gained there. I'm really excited!

January 30, 2025 at 10:31 PM

Reposted by Taylor W. Killian

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

Last year's workshops were some of the most unique, weird, exciting ones I've ever attending from "Finding the Frame" to "RL Beyond Rewards" to "Deployable RL". I'm so excited for this year's unexpected entrants.

Reinforcement Learning Conference @rl-conference.bsky.social · Jan 29

RLC call for workshops is out rl-conference.cc/callforworks...!
Submissions open on Feb 3rd, deadline on March 7, with the workshops on Aug. 5th. Last year's workshops were inimitable and we @claireve.bsky.social and Josiah Hanna) look forward to your amazing proposals

RLJ | RLC Call for Workshops

rl-conference.cc

January 29, 2025 at 4:07 PM

Taylor W. Killian

@twkillian.bsky.social

There's not much that is as disappointing as a scam call when you're expecting to hear from someone with a number you don't know...

January 28, 2025 at 10:00 PM

Taylor W. Killian

@twkillian.bsky.social

My bound and published PhD thesis arrived in the mail today. It's an amazing and overwhelming feeling to hold the product of 5 years of work in my hands.

January 24, 2025 at 11:33 PM

Taylor W. Killian

@twkillian.bsky.social

It's that time of year where reviewer invitations start piling up. I'm excited to contribute to the success of our research venues and am especially looking forward to the submissions I get the chance to review, especially those coming from @rl-conference.bsky.social!

January 22, 2025 at 6:42 AM

Reposted by Taylor W. Killian

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

My actual favorite niche starter pack here:
go.bsky.app/2Gibu1a

January 20, 2025 at 8:17 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news