Miguel Suau
miguelsuau.bsky.social
Miguel Suau
@miguelsuau.bsky.social
Machine Teacher. Research Scientist at Phaidra. PhD from TU Delft. Previously JP Morgan, Huawei, Unity.

https://www.suau.io/
Excited to be attending EWRL again this year! I'll be giving a talk on Thursday (Sep 8) about my work on policy confounding
🚀 Less than one week to EWRL 2025!

👀 Did you check the program? euro-workshop-on-reinforcement-learning.github.io/ewrl18/progr...
🎟️ Register here: site.pheedloop.com/event/EWRL/h...

Can’t wait to meet you all in person! #EWRL2025
September 13, 2025 at 9:07 AM
Reposted by Miguel Suau
📣 Early bird registration ends today!
Register and join us in Tübingen for EWRL 2025: site.pheedloop.com/event/EWRL/h...
PheedLoop
PheedLoop: Hybrid, In-Person & Virtual Event Software
site.pheedloop.com
September 3, 2025 at 7:44 AM
Excited to share my new preprint, 'Breaking Habits: On the Role of the Advantage Function in Learning Causal State Representations,' which I presented last week at @rldmdublin2025.bsky.social.
Link: arxiv.org/abs/2506.11912
June 18, 2025 at 7:55 PM
Phaidra is hiring a Research Scientist to work on sequential decision-making problems. I'm at the RLDM conference in Dublin this week. If you're attending and would like to learn more about the role or the company, feel free to reach out!
job-boards.greenhouse.io/phaidra/jobs...
AI Research Scientist (Sequential Decision Making)
Remote
job-boards.greenhouse.io
June 11, 2025 at 10:40 AM
Reposted by Miguel Suau
AI benchmarking culture is completely out of control. Tables with dozens of methods, datasets, and bold numbers, trying to answer a question that perhaps no one should be asking anymore.
May 30, 2025 at 9:55 PM
Reposted by Miguel Suau
🚨🚨 RLC deadline has been extended by a week! Abstract deadline is Feb. 21 with a paper deadline of Feb. 28 🚨🚨. Please spread the word!
February 8, 2025 at 6:05 PM
Reposted by Miguel Suau
We've built a simulated driving agent that we trained on 1.6 billion km of driving with no human data.
It is SOTA on every planning benchmark we tried.
In self-play, it goes 20 years between collisions.
February 6, 2025 at 6:34 PM
Reposted by Miguel Suau
RLC 2025 is looking for reviewers and reviewer nominations, for folks looking to innovate on the RL reviewing process. If you know someone qualified, please nominate them (but read the docs below): forms.gle/3yCeBjn4Yhi7...
And please help us spread the word!
forms.gle
January 31, 2025 at 5:08 PM
Reposted by Miguel Suau
I have a draft of my introduction to cooperative multi-agent reinforcement learning on arxiv. Check it out and let me know any feedback you have. The plan is to polish and extend the material into a more comprehensive text with Frans Oliehoek.

arxiv.org/abs/2405.06161
A First Introduction to Cooperative Multi-Agent Reinforcement Learning
Multi-agent reinforcement learning (MARL) has exploded in popularity in recent years. While numerous approaches have been developed, they can be broadly categorized into three main types: centralized ...
arxiv.org
January 7, 2025 at 4:25 PM
Reposted by Miguel Suau
If you're at NeurIPS, RLC is hosting an RL event from 8 till late at The Pearl on Dec. 11th. Join us, meet all the RL researchers, and spread the word!
December 10, 2024 at 9:55 PM
Hello, Bluesky! The entire Phaidra research team is excited to be attending #NeurIPS2024 this year. I arrived in Canada early to enjoy a few days of skiing in Whistler before the conference kicks off. If you’re attending and would like to connect, feel free to drop me a message!
December 8, 2024 at 11:41 PM