Momchil Tomov
momchiltomov.bsky.social
Momchil Tomov
@momchiltomov.bsky.social
Cognitive Neuroscientist @ Harvard, AI Researcher @ Motional

Models of human & robot decision making in complex environments, including video games and urban driving. https://www.momchiltomov.com/
Here are several examples of real-world cut-ins. TreeIRL anticipates the cut-in and brakes comfortably, while the other baselines either brake too late or brake uncomfortably (see inset history of vehicle kinematics).
September 18, 2025 at 3:49 PM
Tree achieves 1-2 orders of magnitude improvement in safety, while also improving comfort and progress! On the road, it is by far the best planner.
September 18, 2025 at 3:48 PM
We feed the MCTS trajectories into a deep scoring function trained with IRL to choose the most human-like among them.

The IRL network is trained on many hours of human export demonstrations to effectively reverse-engineer the intrinsic reward function of human driving.
September 18, 2025 at 3:48 PM
MCTS uses search + ML to efficiently explore combinatorially large search spaces. In most applications (e.g. AlphaGo), MCTS outputs a single next best action.

The main innovation is to reporpose MCTS to ouput a *set of possible sequences* of actions (i.e., trajectories).
September 18, 2025 at 3:47 PM
Excited to share a new preprint based on my work this past year:

**TreeIRL** is a novel planner that combines classical search with learning-based methods to achieve state-of-the-art performance in simulation and in **real-world autonomous driving**! 🚘 🤖 🚀
September 18, 2025 at 3:39 PM