1/ 🔥Ever experinced softmax attention fade as sequences grow?
That blur is why many attention mechanisms stumble on algorithmic and reasoning tasks. Well, we have a Algebraic Geometric Tropical solution 🌴
The paper: arxiv.org/abs/2505.17190
The code: github.com/Baran-phys/T...
The paper: arxiv.org/abs/2505.17190
The code: github.com/Baran-phys/T...
- Let’s use LLM to do a baby science/math, after it doesn’t work, headline: LLM is bad at the baby math task —> guaranteed virality 😒
- Meanwhile, you develope a novel (non-LLM) method to solve this issue, report success on a deep math problem
—> naa, not enough drama🤦🏻
- Let’s use LLM to do a baby science/math, after it doesn’t work, headline: LLM is bad at the baby math task —> guaranteed virality 😒
- Meanwhile, you develope a novel (non-LLM) method to solve this issue, report success on a deep math problem
—> naa, not enough drama🤦🏻
Check out our paper: arxiv.org/abs/2505.17190
Our code: github.com/Baran-phys/T...
Check out our paper: arxiv.org/abs/2505.17190
Our code: github.com/Baran-phys/T...
1/ 🔥Ever experinced softmax attention fade as sequences grow?
That blur is why many attention mechanisms stumble on algorithmic and reasoning tasks. Well, we have a Algebraic Geometric Tropical solution 🌴
1/ 🔥Ever experinced softmax attention fade as sequences grow?
That blur is why many attention mechanisms stumble on algorithmic and reasoning tasks. Well, we have a Algebraic Geometric Tropical solution 🌴
openreview.net/forum?id=4X9...
openreview.net/forum?id=4X9...
🫵 Calling all pioneers in AI4Math:
📜 Submit your exciting work:
sites.google.com/view/ai4math...
🫵 Calling all pioneers in AI4Math:
📜 Submit your exciting work:
sites.google.com/view/ai4math...
arxiv.org/abs/2503.06712
arxiv.org/abs/2503.06712
I'll be presenting our results, openreview.net/forum?id=4X9..., at the Math4AI/AI4Math Workshop @mpiMathSci! 🔥
📅 Registration is open until Feb 28
🔗 www.mis.mpg.de/events/serie...
#AI4Math
I'll be presenting our results, openreview.net/forum?id=4X9..., at the Math4AI/AI4Math Workshop @mpiMathSci! 🔥
📅 Registration is open until Feb 28
🔗 www.mis.mpg.de/events/serie...
#AI4Math
Can Transformers Do Enumerative Geometry? (arxiv.org/abs/2408.14915) has been accepted to the
@iclr-conf.bsky.social!!
Congrats to my collaborators Alessandro Giacchetto at ETH Züruch and Roderic G. Corominas at Harvard.
#ICLR2025 #AI4Math #ORIGINS
Can Transformers Do Enumerative Geometry? (arxiv.org/abs/2408.14915) has been accepted to the
@iclr-conf.bsky.social!!
Congrats to my collaborators Alessandro Giacchetto at ETH Züruch and Roderic G. Corominas at Harvard.
#ICLR2025 #AI4Math #ORIGINS