🔍Eau De Q-Network gradually prunes the network weights at the agent's learning pace, ultimately reaching a final sparsity level that is discovered by the algorithm!🔎
👉📰 arxiv.org/pdf/2503.01437
Iterated $Q$-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning
Théo Vincent, Daniel Palenicek, Boris Belousov, Jan Peters, Carlo D'Eramo
https://openreview.net/forum?id=Lt2H8Bd8jF
#reinforcement #iterative #iterations
Iterated $Q$-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning
Théo Vincent, Daniel Palenicek, Boris Belousov, Jan Peters, Carlo D'Eramo
https://openreview.net/forum?id=Lt2H8Bd8jF
#reinforcement #iterative #iterations
I had the pleasure of presenting my research as a Contributed Talk 🎉
Special thanks to the organizers for making it happen!
I had the pleasure of presenting my research as a Contributed Talk 🎉
Special thanks to the organizers for making it happen!
I will be presenting 4 posters. Feel free to come and exchange with me during the conference, at the Finding the Frame workshop, or at the Inductive Biases workshop🙂
I will be presenting 4 posters. Feel free to come and exchange with me during the conference, at the Finding the Frame workshop, or at the Inductive Biases workshop🙂
In case you could not attend, feel free to check it out 👉
youtu.be/RCA22JWiiY8?...
In case you could not attend, feel free to check it out 👉
youtu.be/RCA22JWiiY8?...
I will be presenting the research I have been working on for the last 2 years with Carlo D'Eramo, @jan-peters.bsky.social, and many more collaborators!
I will be presenting the research I have been working on for the last 2 years with Carlo D'Eramo, @jan-peters.bsky.social, and many more collaborators!
I will be presenting Eau De Q-Network today @rldmdublin2025.bsky.social Feel free to come and exchange at Poster #28 🎤
bsky.app/profile/theo...
I will be presenting Eau De Q-Network today @rldmdublin2025.bsky.social Feel free to come and exchange at Poster #28 🎤
bsky.app/profile/theo...
🧵
#Robotics #TactileSensing #ReinforcementLearning #Transformers #ActivePerception @ias-tudarmstadt.bsky.social
🧵
#Robotics #TactileSensing #ReinforcementLearning #Transformers #ActivePerception @ias-tudarmstadt.bsky.social
🔍Eau De Q-Network gradually prunes the network weights at the agent's learning pace, ultimately reaching a final sparsity level that is discovered by the algorithm!🔎
👉📰 arxiv.org/pdf/2503.01437
🔍Eau De Q-Network gradually prunes the network weights at the agent's learning pace, ultimately reaching a final sparsity level that is discovered by the algorithm!🔎
👉📰 arxiv.org/pdf/2503.01437
Happy to be organizing this with @georgiachal.bsky.social, Yu Xiang, @danfei.bsky.social and @galasso.bsky.social!
Happy to be organizing this with @georgiachal.bsky.social, Yu Xiang, @danfei.bsky.social and @galasso.bsky.social!
i-QN learns several Bellman iterations in parallel instead of learning them sequentially via repeated target updates ✨ This directly translates to performance improvements on the Atari and MuJoCo benchmarks 🚀
Théo Vincent, Daniel Palenicek, Boris Belousov, Jan Peters, Carlo D'Eramo
Action editor: Pablo Castro
https://openreview.net/forum?id=Lt2H8Bd8jF
#reinforcement #iterative #iterations
i-QN learns several Bellman iterations in parallel instead of learning them sequentially via repeated target updates ✨ This directly translates to performance improvements on the Atari and MuJoCo benchmarks 🚀
Théo Vincent, Daniel Palenicek, Boris Belousov, Jan Peters, Carlo D'Eramo
Action editor: Pablo Castro
https://openreview.net/forum?id=Lt2H8Bd8jF
#reinforcement #iterative #iterations
Théo Vincent, Daniel Palenicek, Boris Belousov, Jan Peters, Carlo D'Eramo
Action editor: Pablo Castro
https://openreview.net/forum?id=Lt2H8Bd8jF
#reinforcement #iterative #iterations
go.bsky.app/3WPHcHg
go.bsky.app/3WPHcHg