Robin Ranjit Singh Chauhan
banner
robinchauhan.bsky.social
Robin Ranjit Singh Chauhan
@robinchauhan.bsky.social
Host TalkRL Podcast, Aspiring RL researcher
AgFunder VC Head of Eng, Ex-MSFT, Waterloo computer engineering
Sunshine Coast BC Canada
E71: Jake Beck, Alex Goldie, & Cornelius Braun on Sutton's OaK, Metalearning, LLMs, Squirrels at @rl-conference.bsky.social
A few thoughts with @jakeabeck.bsky.social, @alexgoldie.bsky.social and @corneliusbraun.bsky.social
after Rich Sutton's fascinating lecture on his OaK architecture at UofA
August 26, 2025 at 5:42 AM
E69: Thomas Akam on Model-based RL in the Brain
@ox.ac.uk neuroscientist Prof Akam on RL in brains vs machines, dopamine as TD-error (or not), hippocampal replay & Dyna, model-free vs model-based myths, and why ML experts should consider neuroscience careers.
August 26, 2025 at 5:38 AM
E67: Stefano Albrecht on Multi-Agent RL @ RLDM 2025
Stefano Albrecht shares the story behind his multi-agent RL textbook, and how DeepFlow AI turns these ideas into action with LLM-powered agents for business automation.
Recorded at
@rldmdublin2025.bsky.social
August 26, 2025 at 5:35 AM
Rich Sutton presenting his OaK architecture at @rl-conference.bsky.social
August 9, 2025 at 8:05 PM
August 9, 2025 at 8:04 PM
August 9, 2025 at 2:42 AM
E66: Satinder Singh: The Origin Story of RLDM @ RLDM 2025

Professor Satinder Singh of Google DeepMind and U of Michigan is co-founder of ‪@rldmdublin2025.bsky.social‬
Here he narrates the origin story of the Reinforcement Learning and Decision Making meeting (not conference).
June 25, 2025 at 4:33 PM
TalkRL Podcast at @rldmdublin2025.bsky.social in Dublin Ireland!
June 12, 2025 at 4:45 PM
E65: NeurIPS 2024 – Posters and Hallways 3

- Claire Bizon Monroc of Inria : WFCRL for Wind Farm Control
Andrew Wagenmaker of @ucberkeleyofficial.bsky.social : Leveraging Simulation to Bridge Sim-to-Real Gap
- @harwiltz.bsky.social of @mila-quebec.bsky.social : Multivariate Distributional RL
(cont)
March 10, 2025 at 5:21 PM
BSKY
E64: NeurIPS 2024 – Posters and Hallways 2

- Jonathan Cook of Oxford: Cultural Accumulation in Reinforcement Learning
- Yifei Zhou of Berkeley AI Research: DigiRL for In-The-Wild Device-Control Agents
- Rory Young of U Glasgow: A Lyapunov Exponent Approach to RL Robustness
(cont'd)
March 6, 2025 at 5:09 AM
E63: NeurIPS 2024 - Posters and Hallways 1

Jiaheng Hu of UTexas on Unsupervised Skill Discovery for HRL
@skandermoalla.bsky.social of EPFL: Representation and Trust in PPO
Adil Zouitine of IRT Saint Exupery/Hugging Face : Time-Constrained Robust MDPs
March 3, 2025 at 2:06 PM
E62: Abhishek Naik on Continuing RL & Average Reward
How should RL handle non-episodic tasks, like Mars rovers or HPC scheduling? @abhisheknaik96.bsky.social shares insights from his PhD with Rich Sutton, plus how almost every discounted-reward algorithm can be improved by reward centering.
February 10, 2025 at 4:35 PM
ty for doing this!
$50 for guinea worm
December 30, 2024 at 6:40 PM
Infinitesimus and centesimal also fun
December 30, 2024 at 7:21 AM
Theme frequency from the crowd
December 25, 2024 at 4:56 PM
E61: Neurips 2024 RL meetup Hot takes: "What sucks about RL?"
What do RL researchers complain about after hours at the bar?  In this "Hot takes" episode, we find out!  
Recorded at The Pearl in downtown Vancouver, during the RL meetup after a day of Neurips 2024.
December 25, 2024 at 4:50 PM