dapatil211.bsky.social
@dapatil211.bsky.social
PhD student at Mila doing lifelong learning and nonstationary optimization
Reposted
Can better architectures & representations make self-play enough for zero-shot coordination? 🤔
We explore this in our ICLR 2025 paper: A Generalist Hanabi Agent. We develop R3D2, the first agent to master all Hanabi settings and generalize to novel partners! 🚀 #ICLR2025 1/n
April 4, 2025 at 5:12 PM