We introduce MMBench: a 200-task RL benchmark, and Newt: a language-conditioned multitask world model trained with large-scale online RL.
www.nicklashansen.com/NewtWM/
Code, checkpoints, dataset etc. are open-source!
We introduce MMBench: a 200-task RL benchmark, and Newt: a language-conditioned multitask world model trained with large-scale online RL.
www.nicklashansen.com/NewtWM/
Code, checkpoints, dataset etc. are open-source!