Jannis Born
banner
jannisblrn.bsky.social
Jannis Born
@jannisblrn.bsky.social
Research Scientist @IBM - AI for Scientific Discovery! Tech & sports enthusiast
#ICML Why are LLMs so powerful but still suck at math? 🤔 A key problem is cross-entropy loss: It is nominal-scale, so tokens are unordered. That makes sense for words, but not for numbers. For a "5" label, predicting “6” or “9” gives the same loss 😱 Yes, it's crazy! No, nobody has fixed this yet! ⬇️
July 3, 2025 at 9:21 PM