working on providing reliable and verifiable ai mechanisms
#RL & formal methods
delgrange.me
we've had a few works proposing techniques for enabling scaling in deep rl, such as MoEs, tokenization, & sparse training.
ghada sokar and i looked further & found a bit more clarity into *what* enables scaling, leading us to simpler solutions (see GAP in figure)!
1/
we've had a few works proposing techniques for enabling scaling in deep rl, such as MoEs, tokenization, & sparse training.
ghada sokar and i looked further & found a bit more clarity into *what* enables scaling, leading us to simpler solutions (see GAP in figure)!
1/
Join our interdisciplinary research project on causal agent-based modelling!
🔍 Looking for curious minds with a MSc degree (or near to completing one) in CS/AI/related fields.
📍 Location: Utrecht University, NL
🗓️ Deadline: 16 June 2025
📩 Info: www.uu.nl/en/organisat...
Join our interdisciplinary research project on causal agent-based modelling!
🔍 Looking for curious minds with a MSc degree (or near to completing one) in CS/AI/related fields.
📍 Location: Utrecht University, NL
🗓️ Deadline: 16 June 2025
📩 Info: www.uu.nl/en/organisat...
We combine reinforcement learning 🤖🧠 & reactive synthesis ⚙️ for learning scalable safe policies in complex tasks with formal guarantees.
📑paper: arxiv.org/abs/2402.13785
✍️blogpost: delgrange.me/post/composi...
A thread🧵⤵️
We combine reinforcement learning 🤖🧠 & reactive synthesis ⚙️ for learning scalable safe policies in complex tasks with formal guarantees.
📑paper: arxiv.org/abs/2402.13785
✍️blogpost: delgrange.me/post/composi...
A thread🧵⤵️
-Do I need multiple training runs?
-How do I report model confidence?
-And a great section on common mistakes to fend off reviewer 2
🧪
#DRL
#reinforcementlearning
#AI
arxiv.org/abs/2304.01315
-Do I need multiple training runs?
-How do I report model confidence?
-And a great section on common mistakes to fend off reviewer 2
🧪
#DRL
#reinforcementlearning
#AI
arxiv.org/abs/2304.01315
go.bsky.app/LWyGAAu
go.bsky.app/LWyGAAu
go.bsky.app/3WPHcHg
go.bsky.app/3WPHcHg