Models of human & robot decision making in complex environments, including video games and urban driving. https://www.momchiltomov.com/
The IRL network is trained on many hours of human export demonstrations to effectively reverse-engineer the intrinsic reward function of human driving.
The IRL network is trained on many hours of human export demonstrations to effectively reverse-engineer the intrinsic reward function of human driving.
The main innovation is to reporpose MCTS to ouput a *set of possible sequences* of actions (i.e., trajectories).
The main innovation is to reporpose MCTS to ouput a *set of possible sequences* of actions (i.e., trajectories).
**TreeIRL** is a novel planner that combines classical search with learning-based methods to achieve state-of-the-art performance in simulation and in **real-world autonomous driving**! 🚘 🤖 🚀
**TreeIRL** is a novel planner that combines classical search with learning-based methods to achieve state-of-the-art performance in simulation and in **real-world autonomous driving**! 🚘 🤖 🚀