Kaelan Donatella
kaelandonatella.bsky.social
Kaelan Donatella
@kaelandonatella.bsky.social
hardware/software to make ai systems faster and more reliable. i like clouds and french cinema
Nice to see some work trying to disentangle concepts :)
How do LLMs learn to reason from data? Are they ~retrieving the answers from parametric knowledge🦜? In our new preprint, we look at the pretraining data and find evidence against this:

Procedural knowledge in pretraining drives LLM reasoning ⚙️🔢

🧵⬇️
November 20, 2024 at 6:46 PM
shape rotators unhappy
November 14, 2024 at 10:18 AM
I like it here but there are way too many posts about bsky itself
November 14, 2024 at 10:17 AM
if we can get the same type of paper discussion content without the ai influencers here that would be so nice
One thing still missing here is good discussion of academic papers, so I guess I'll be the change I want to see in the world

Really interesting results from Jacob Andreas's group showing great performance on ARC-AGI just from doing a few gradient descent steps at test time arxiv.org/abs/2411.07279
The Surprising Effectiveness of Test-Time Training for Abstract Reasoning
Language models have shown impressive performance on tasks within their training distribution, but often struggle with novel problems requiring complex reasoning. We investigate the effectiveness of t...
arxiv.org
November 13, 2024 at 4:30 PM