Visit my research blog at https://alexshtf.github.io
I like things on the intersection between opimization, numerical analysis, ML, and software engineering. I have a blog, where I write about stuff I like:
alexshtf.github.io
You're welcome to add me to starter packs as you see fit.
11011110.github.io/blog/2025/05... via @theory.report
11011110.github.io/blog/2025/05... via @theory.report
@beenwrekt.bsky.social recently started a bit of noise with his post about nonexistence of overfitting, but he has a point. In this post we explore it using simple polynomial curve fitting, *without regularization*, using another interesting basis.
alexshtf.github.io/2025/03/27/F...
@beenwrekt.bsky.social recently started a bit of noise with his post about nonexistence of overfitting, but he has a point. In this post we explore it using simple polynomial curve fitting, *without regularization*, using another interesting basis.
alexshtf.github.io/2025/03/27/F...
Steven Jecmen, Nihar B Shah, Fei Fang, Leman Akoglu
Action editor: Laurent Charlin
https://openreview.net/forum?id=o58uy91V2V
#collusion #colluders #fraud
Steven Jecmen, Nihar B Shah, Fei Fang, Leman Akoglu
Action editor: Laurent Charlin
https://openreview.net/forum?id=o58uy91V2V
#collusion #colluders #fraud
Alex Shtoff, Elie Abboud, Rotem Stram, Oren Somekh
Action editor: Andriy Mnih
https://openreview.net/forum?id=M4222IBHsh
#factorization #feature #features
Alex Shtoff, Elie Abboud, Rotem Stram, Oren Somekh
Action editor: Andriy Mnih
https://openreview.net/forum?id=M4222IBHsh
#factorization #feature #features
- Claire Mathieu : www.youtube.com/watch?v=_ZXZ...
- Gabriel Peyré : www.youtube.com/watch?v=vQOF...
- Jérôme Bolte : www.youtube.com/watch?v=tjkg...
- Axel Parmentier : www.youtube.com/watch?v=DohO...
Enjoy 🙂
- Claire Mathieu : www.youtube.com/watch?v=_ZXZ...
- Gabriel Peyré : www.youtube.com/watch?v=vQOF...
- Jérôme Bolte : www.youtube.com/watch?v=tjkg...
- Axel Parmentier : www.youtube.com/watch?v=DohO...
Enjoy 🙂
Generative AI is a Parasitic Cancer
www.youtube.com/watch?v=-opB...
xᵢ₊₁=(1 - α)xᵢ+αyᵢ
One of the most overlooked facts is that it is actually online gradient descent!
xᵢ₊₁=(1 - α)xᵢ+αyᵢ
One of the most overlooked facts is that it is actually online gradient descent!
This post is about our recent TMLR paper, "Function Basis Encoding of Numerical Features in Factorization Machines", by Alex Shtoff, Elie Abboud, Rotem Stram, and Oren Somekh.
Paper: openreview.net/forum?id=M42...
Code: github.com/alexshtf/con...
This post is about our recent TMLR paper, "Function Basis Encoding of Numerical Features in Factorization Machines", by Alex Shtoff, Elie Abboud, Rotem Stram, and Oren Somekh.
Paper: openreview.net/forum?id=M42...
Code: github.com/alexshtf/con...
Jirayu Burapacheep, Yixuan Li
Action editor: Changjian Shui
https://openreview.net/forum?id=FmA1JPWBM8
#classifiers #classifier #classification
Jirayu Burapacheep, Yixuan Li
Action editor: Changjian Shui
https://openreview.net/forum?id=FmA1JPWBM8
#classifiers #classifier #classification
h(x) = p(A x + b), with A Aᵀ = α I
It is [1]:
proxₜₕ(x) = x + t⁻¹ Aᵀ(proxₜₚ(A x) - A x)
[1] Combettes, Wajs. Signal Recovery by Proximal Forward-Backward Splitting.
h(x) = p(A x + b), with A Aᵀ = α I
It is [1]:
proxₜₕ(x) = x + t⁻¹ Aᵀ(proxₜₚ(A x) - A x)
[1] Combettes, Wajs. Signal Recovery by Proximal Forward-Backward Splitting.
for fans of: mean estimation, online learning with log loss, optimal portfolios, hypothesis testing with E-values, etc.
dig in:
arxiv.org/abs/2412.02640
for fans of: mean estimation, online learning with log loss, optimal portfolios, hypothesis testing with E-values, etc.
dig in:
arxiv.org/abs/2412.02640
How do you call a model for estimating the conditional mean?
How do you call a model for estimating the conditional mean?
This has had incredible impact, most notably on how ML reviews are conducted
This has had incredible impact, most notably on how ML reviews are conducted
probnum25.github.io
probnum25.github.io