Josh McClellan
joshmcclellan.bsky.social
Josh McClellan
@joshmcclellan.bsky.social
I study generalization for reinforcement learning
Reposted by Josh McClellan
I'm looking to hire a student researcher to work on an exciting project for 6 months in DeepMind Montreal.

Requirements:
- Full-time masters/PhD student 🧑🏾‍🎓
- Substantial expertise in multi-agent RL, ideally including publication(s) 🤖🤖
- Strong Python coding skills 🐍

Is this you? Get in touch!
March 20, 2025 at 12:29 AM
Reposted by Josh McClellan
Super excited to share our paper, Simplifying Deep Temporal Difference Learning has been accepted as a spotlight at ICLR! My fab collaborator Matteo Gallici and I have written a three part blog on the work, so stay tuned for that! :)
@flair-ox.bsky.social
arxiv.org/pdf/2407.04811
arxiv.org
March 18, 2025 at 11:48 AM
Reposted by Josh McClellan
NOETIX robot: 44lbs, <4 feet tall, 18 dof, Jetson on board. Starting at $5.5k. At this rate I am fairly convinced there will be robots absolutely everywhere within 5 years; although probably more form factors than just humanoids.
March 15, 2025 at 8:23 PM
Reposted by Josh McClellan
OpenAI has many problems, but I can think of few outcomes worse than Musk gaining control over it.

He will continue to drum up fear about rogue AI being an existential threat to justify his consolidation of power and use it to dienfranchise people.

finance.yahoo.com/news/elon-mu...
Elon Musk-led group makes $97.4 billion bid for control of OpenAI, WSJ reports
The offer intensifies a longstanding battle between OpenAI CEO Sam Altman and Musk over the future of the startup at the heart of a boom in generative AI technology. Musk's attorney, Marc Toberoff, s...
finance.yahoo.com
February 10, 2025 at 9:26 PM
Reposted by Josh McClellan
We've built a simulated driving agent that we trained on 1.6 billion km of driving with no human data.
It is SOTA on every planning benchmark we tried.
In self-play, it goes 20 years between collisions.
February 6, 2025 at 6:34 PM
Reposted by Josh McClellan
1. Today the NIH director issued a new directive slashing overhead rates to 15%.

I want to provide some context on what that means and why it matters.

grants.nih.gov/grants/guide...
NOT-OD-25-068: Supplemental Guidance to the 2024 NIH Grants Policy Statement: Indirect Cost Rates
NIH Funding Opportunities and Notices in the NIH Guide for Grants and Contracts: Supplemental Guidance to the 2024 NIH Grants Policy Statement: Indirect Cost Rates NOT-OD-25-068. OD
grants.nih.gov
February 8, 2025 at 12:18 AM
We will be presenting this tomorrow at Neurips in the evening poster session! Come stop by to chat!
I'm excited to share that our paper, "Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance," has been accepted to NeurIPS 2024! 🎉

#NeurIPS #MARL #AI #ReinforcementLearning #MachineLearning #Equivariance #GraphNeuralNetworks
December 13, 2024 at 3:05 AM
I'm excited to share that our paper, "Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance," has been accepted to NeurIPS 2024! 🎉

#NeurIPS #MARL #AI #ReinforcementLearning #MachineLearning #Equivariance #GraphNeuralNetworks
December 6, 2024 at 3:20 PM