Late to the party (since I just took some time to spend with our two little ones) but luckily good science is timeless ;)
PQN Blog 1/3: TD methods are the bread and butter of RL, yet can have convergence issues when used in practice. This has always annoyed me. Find out below why TD is so unstable and how can we understand this instability better using the TD Jacobian. @flair-ox.bsky.social @jfoerst.bsky.social
Fixing TD Pt I: Why is Temporal Difference Learning so Unstable?
blog.foersterlab.com
May 2, 2025 at 8:13 PM
Late to the party (since I just took some time to spend with our two little ones) but luckily good science is timeless ;)
PQN puts Q-learning back on the map and now comes with a blog post + Colab demo! Also, congrats to the team for the spotlight at #ICLR2025
PQN blog 3/3 👉take a look at Matteo's 5-minute blog covering PQN’s key features, plus a Colab demo with JAX & PyTorch implementations mttga.github.io/posts/pqn/
🔎 For a deeper dive into the theory:
blog.foersterlab.com/fixing-td-pa...
blog.foersterlab.com/fixing-td-pa...
See you in Singapore! 🇸🇬
🔎 For a deeper dive into the theory:
blog.foersterlab.com/fixing-td-pa...
blog.foersterlab.com/fixing-td-pa...
See you in Singapore! 🇸🇬
Simplifying Deep Temporal Difference Learning
A modern implementation of Deep Q-Network without target networks and replay buffers.
mttga.github.io
March 20, 2025 at 11:51 AM
PQN puts Q-learning back on the map and now comes with a blog post + Colab demo! Also, congrats to the team for the spotlight at #ICLR2025
My group @FLAIR_Ox is recruiting a postdoc and looking for someone who can get started by the end of April. Deadline to apply is in one week (!), 19th of March at noon, so please help spread the word: my.corehr.com/pls/uoxrecru...
Job Details
my.corehr.com
March 12, 2025 at 3:17 PM
My group @FLAIR_Ox is recruiting a postdoc and looking for someone who can get started by the end of April. Deadline to apply is in one week (!), 19th of March at noon, so please help spread the word: my.corehr.com/pls/uoxrecru...
Reposted by Jakob Foerster
That's the first time that I see a video by chess.com cited in an accepted ICLR paper, in particular on handshakes vs. fist bumps during a chess competition ...
By Oxford, @jfoerst.bsky.social
Paper: openreview.net/forum?id=wFg...
Video: www.youtube.com/watch?v=6fS7...
@danielrensch.chess.com
By Oxford, @jfoerst.bsky.social
Paper: openreview.net/forum?id=wFg...
Video: www.youtube.com/watch?v=6fS7...
@danielrensch.chess.com
February 9, 2025 at 1:40 PM
That's the first time that I see a video by chess.com cited in an accepted ICLR paper, in particular on handshakes vs. fist bumps during a chess competition ...
By Oxford, @jfoerst.bsky.social
Paper: openreview.net/forum?id=wFg...
Video: www.youtube.com/watch?v=6fS7...
@danielrensch.chess.com
By Oxford, @jfoerst.bsky.social
Paper: openreview.net/forum?id=wFg...
Video: www.youtube.com/watch?v=6fS7...
@danielrensch.chess.com
Reposted by Jakob Foerster
@jfoerst.bsky.social take on how the community sees the ARC Challenge and how we evaluate models and use benchmarks nowadays is 👌.
#more_science_less_hype (please).
PS: Amazing discussion and good brain food, as usual with MLST.
#more_science_less_hype (please).
PS: Amazing discussion and good brain food, as usual with MLST.
ImageNet Moment for Reinforcement Learning?
YouTube video by Machine Learning Street Talk
www.youtube.com
February 18, 2025 at 7:26 PM
@jfoerst.bsky.social take on how the community sees the ARC Challenge and how we evaluate models and use benchmarks nowadays is 👌.
#more_science_less_hype (please).
PS: Amazing discussion and good brain food, as usual with MLST.
#more_science_less_hype (please).
PS: Amazing discussion and good brain food, as usual with MLST.
Reposted by Jakob Foerster
Second #runconference @neuripsconf.bsky.social #NeurIPS2024 !
@jfoerst.bsky.social @ferranalet.bsky.social @adamjelley.bsky.social @enjeeneer.io
Same deal for tomorrow: 7am at
goo.gl/maps/8Z8eMrd...
Join us!
@jfoerst.bsky.social @ferranalet.bsky.social @adamjelley.bsky.social @enjeeneer.io
Same deal for tomorrow: 7am at
goo.gl/maps/8Z8eMrd...
Join us!
December 11, 2024 at 6:25 PM
Second #runconference @neuripsconf.bsky.social #NeurIPS2024 !
@jfoerst.bsky.social @ferranalet.bsky.social @adamjelley.bsky.social @enjeeneer.io
Same deal for tomorrow: 7am at
goo.gl/maps/8Z8eMrd...
Join us!
@jfoerst.bsky.social @ferranalet.bsky.social @adamjelley.bsky.social @enjeeneer.io
Same deal for tomorrow: 7am at
goo.gl/maps/8Z8eMrd...
Join us!
🚨 PSA 🚨 Deadline to apply for your dream Phd in ML
@FLAIR_Ox
is coming up on the 2nd of December AOE. We work on compute-only scaling of LLMs, (meta/multi-agent) RL at the Hyperscale, Human-AI coordination, opponent-shaping for vaccine design, GenAI for finance & much more..
@FLAIR_Ox
is coming up on the 2nd of December AOE. We work on compute-only scaling of LLMs, (meta/multi-agent) RL at the Hyperscale, Human-AI coordination, opponent-shaping for vaccine design, GenAI for finance & much more..
DPhil in Engineering Science | University of Oxford
About the courseThe DPhil in Engineering Science will offer you the opportunity to develop in-depth knowledge, understanding and expertise in your chosen field of engineering research. To support
ox.ac.uk
November 29, 2024 at 7:45 PM
🚨 PSA 🚨 Deadline to apply for your dream Phd in ML
@FLAIR_Ox
is coming up on the 2nd of December AOE. We work on compute-only scaling of LLMs, (meta/multi-agent) RL at the Hyperscale, Human-AI coordination, opponent-shaping for vaccine design, GenAI for finance & much more..
@FLAIR_Ox
is coming up on the 2nd of December AOE. We work on compute-only scaling of LLMs, (meta/multi-agent) RL at the Hyperscale, Human-AI coordination, opponent-shaping for vaccine design, GenAI for finance & much more..
wth did we not go to an open-source and non-for profit alternative? en.wikipedia.org/wiki/Bluesky
November 23, 2024 at 3:00 PM
wth did we not go to an open-source and non-for profit alternative? en.wikipedia.org/wiki/Bluesky
Hello BlueSky! Joao Henriques (joao.science) and I are hiring a fully funded PhD student (UK/international) for the FAIR-Oxford program. The student will spend 50% of their time @UniofOxford and 50% @MetaAI (FAIR) in London, while completing a DPhil (Oxford PhD). Deadline: 2nd of Dec AOE!!
João F. Henriques
Research of Joao F. Henriques
joao.science
November 23, 2024 at 2:35 PM
Hello BlueSky! Joao Henriques (joao.science) and I are hiring a fully funded PhD student (UK/international) for the FAIR-Oxford program. The student will spend 50% of their time @UniofOxford and 50% @MetaAI (FAIR) in London, while completing a DPhil (Oxford PhD). Deadline: 2nd of Dec AOE!!