banner
dillonlai.bsky.social
@dillonlai.bsky.social
This presentation, The Physics of Language Models, by Zeyuan Allen-Zhu, changed my perspective on LLMs. With a lot of recent research such as the GSM-Symbolic paper by Apple, it’s generally understood that LLMs memorize or find shortcut heuristics to solve problems.
December 10, 2024 at 4:13 PM