🔬research: deep generative learning; agentic systems; synthetic data
PhD @EPFL on reliable magic
Spent time @MSR, @Google
machine learning & company building
🎓@NYU @UvA alumn
Make sure to check it out to learn why training with PPO for too long makes your agent collapse!
Jiaheng Hu of UTexas on Unsupervised Skill Discovery for HRL
@skandermoalla.bsky.social of EPFL: Representation and Trust in PPO
Adil Zouitine of IRT Saint Exupery/Hugging Face : Time-Constrained Robust MDPs
Make sure to check it out to learn why training with PPO for too long makes your agent collapse!
Check out SCION: a new optimizer that adapts to the geometry of your problem using norm-constrained linear minimization oracles (LMOs): 🧵👇
Check out SCION: a new optimizer that adapts to the geometry of your problem using norm-constrained linear minimization oracles (LMOs): 🧵👇
(Did I mention we are hiring on the Generative Media team, btw 👀)
blog.google/technology/g...
(Did I mention we are hiring on the Generative Media team, btw 👀)
blog.google/technology/g...
@caglarai.bsky.social
🧑💻 github.com/CLAIRE-Labo/...
@caglarai.bsky.social
🧑💻 github.com/CLAIRE-Labo/...
We have two accepted papers from my lab:
1. Building on Efficient Foundations: Effective Training of LLMs with Structured Feedforward Layers, on Wednesday, East Exhibit Hall A-C #2010 (1/3)
We have two accepted papers from my lab:
1. Building on Efficient Foundations: Effective Training of LLMs with Structured Feedforward Layers, on Wednesday, East Exhibit Hall A-C #2010 (1/3)
I'm a bit biased, but this is a fantastic conference for #probabilisticML, #causality, #causalML, #tractable #probabilistic #models, #imprecise probabilities,
#reasoning, #neurosymbolic approaches and more.
#causalSky #statSky
Hello, 🦋!
Follow us to reduce uncertainty!
I missed this when it came out, but I love papers like this: a simple change to an already powerful technique, that significantly improves results without introducing complexity or hyperparameters.
I missed this when it came out, but I love papers like this: a simple change to an already powerful technique, that significantly improves results without introducing complexity or hyperparameters.
#EPFL #academicsky
go.bsky.app/73zdbtp
#EPFL #academicsky
go.bsky.app/73zdbtp