gidsy.bsky.social
gidsy.bsky.social
@gidsy.bsky.social
Reposted by gidsy.bsky.social
Long H Dang, David Rawlinson: HRM-Agent: Training a recurrent reasoning model in dynamic environments using reinforcement learning https://arxiv.org/abs/2510.22832 https://arxiv.org/pdf/2510.22832 https://arxiv.org/html/2510.22832
October 28, 2025 at 6:29 AM