Our new #ICML2025 paper tackles this question from first principles, and finds a surprising answer, agents _are_ world models… 🧵
arxiv.org/abs/2506.01622
Our new #ICML2025 paper tackles this question from first principles, and finds a surprising answer, agents _are_ world models… 🧵
arxiv.org/abs/2506.01622
"...closed-loop and open-loop training produce fundamentally different learning dynamics, even when using identical architectures and converging to the same final solution."
arxiv.org/abs/2505.13567
"...closed-loop and open-loop training produce fundamentally different learning dynamics, even when using identical architectures and converging to the same final solution."
arxiv.org/abs/2505.13567