Daphne Cornelisse
daphne-cornelisse.bsky.social
Daphne Cornelisse
@daphne-cornelisse.bsky.social
PhD student at NYU | Building human-like agents | https://www.daphne-cornelisse.com/
Reposted by Daphne Cornelisse
Excited to share a new preprint, accepted as a spotlight at #NeurIPS2025!

Humans are imperfect decision-makers, and autonomous systems should understand how we deviate from idealized rationality

Our paper aims to address this! 👀🧠✨
arxiv.org/abs/2510.25951

a 🧵⤵️
Estimating cognitive biases with attention-aware inverse planning
People's goal-directed behaviors are influenced by their cognitive biases, and autonomous systems that interact with people should be aware of this. For example, people's attention to objects in their...
arxiv.org
November 13, 2025 at 1:20 PM
Rapid RL experimentation is great. But how do you catch silent errors before they slip by?

In this post, I share tools and habits that help me move quickly from idea to result without sacrificing reliability.
How to catch subtle RL bugs before they catch you
Tools and habits for reliable, fast RL experimentation and development
open.substack.com
October 13, 2025 at 11:29 AM
Reposted by Daphne Cornelisse
The single biggest epistemic challenge in the internet era is remaining calibrated about what "normal" people think while the internet throws up an infinite wall of crazy. Thousands of people sharing an absurd opinion on the internet tells you very little!
September 8, 2025 at 6:43 PM
Overnight runs are the overnight oats of research — prep, forget, and rewarding by morning
April 19, 2025 at 12:44 AM
Reposted by Daphne Cornelisse
Building a "human-level" simulated driver that zero-shot generalizes to many benchmarks: a fun interview with @natolambert.bsky.social
www.youtube.com/watch?v=2Q66...
Self-play for Self-driving and where Scaling Reinforcement Learning is Heading with Eugene Vinitsky
YouTube video by Interconnects AI
www.youtube.com
March 12, 2025 at 7:19 PM
Sim agents are key for developing autonomous systems for safety-critical systems, like self-driving cars.

We're open-sourcing sim agents that achieve a 99.8% success rate with < 0.8% failures on the Waymo Dataset. These agents are built through scaling self-play.
February 28, 2025 at 5:19 PM
GPUDrive got accepted to ICLR 2025!

With that, we release GPUDrive v0.4.0! 🚨 You can now install the repo and run your first fast PPO experiment in under 10 minutes.

I’m honestly so excited about the new opportunities and research the sim makes possible. 🚀 1/2
February 20, 2025 at 6:53 PM
Reposted by Daphne Cornelisse
A large group of us (spearheaded by Denizalp Goktas) have put out a position paper on paths towards foundation models for strategic decision-making. Language models still lack these capabilities so we'll need to build them: hal.science/hal-04925309...
February 18, 2025 at 6:33 PM