#navierstokes #kakeyageometry #mathematicalphysics
#navierstokes #kakeyageometry #mathematicalphysics
Convergence of Adam in Deep ReLU Networks via Directional Complexity and Kakeya Bounds
https://arxiv.org/abs/2505.15013
Convergence of Adam in Deep ReLU Networks via Directional Complexity and Kakeya Bounds
https://arxiv.org/abs/2505.15013
not in the lazy regime
not assuming your data is nice
not even pretending initialization matters
mean-field, mixing chains, convergence proofs
paper not out yet. somebody pray for the reviewers
#mathviolence
not in the lazy regime
not assuming your data is nice
not even pretending initialization matters
mean-field, mixing chains, convergence proofs
paper not out yet. somebody pray for the reviewers
#mathviolence
Also me: Fails Twitter’s “select all the bicycles” test three times in a row.
Maybe I am the stochastic process.
#ImNotARobotButIDefineOne
Also me: Fails Twitter’s “select all the bicycles” test three times in a row.
Maybe I am the stochastic process.
#ImNotARobotButIDefineOne
Them: So you make AI?
Me: No, I prove why it maybe works under very specific conditions.
Them: Sounds useful.
Me: I promise it is. To like... six people.
Them: So you make AI?
Me: No, I prove why it maybe works under very specific conditions.
Them: Sounds useful.
Me: I promise it is. To like... six people.
#MLTheory
#MLTheory
Kakeya tells you where it can go.
We use cone crossings and directional bounds to track optimizer paths. Tighter than PAC-Bayes. No flat priors.
Wanna know how?
New ADAM paper drops soon.
#CatsOfML #Kakeya #OptimizerTheory #CSTheory
Kakeya tells you where it can go.
We use cone crossings and directional bounds to track optimizer paths. Tighter than PAC-Bayes. No flat priors.
Wanna know how?
New ADAM paper drops soon.
#CatsOfML #Kakeya #OptimizerTheory #CSTheory
Ours: let the cat out. Nonlinear approximation, dependent data, real-world dynamics and it still finds the value function.
arxiv.org/pdf/2502.05706
#reinforcementlearning #catsofML #TDzero
Ours: let the cat out. Nonlinear approximation, dependent data, real-world dynamics and it still finds the value function.
arxiv.org/pdf/2502.05706
#reinforcementlearning #catsofML #TDzero