Mike Dodds
banner
m-dodds.bsky.social
Mike Dodds
@m-dodds.bsky.social
Formal methods nitwit. https://mikedodds.github.io

AI / math / formal methods paper feed: @ai-fm-papers.bsky.social
That’s true, and I think that’s exactly why Claude does so well proofs. It‘s just I happen to know first hand that proofs are a particularly difficult *kind* of program
September 21, 2025 at 1:12 AM
Hey :) Seems like a lot of people moved here from old Twitter and I’m still catching up
June 25, 2025 at 10:51 PM
If a tool is not popular, it’s uncompelling to argue that everyone is just mistaken. At some point you should ask why the tool isn’t useful (at the current cost/benefit point)
May 24, 2025 at 2:12 AM
I do think a lot of people are in denial though!
January 21, 2025 at 11:49 PM
I don’t think literally everyone should drop what they’re doing. But my sense is PL research as a whole is significantly under-reacting to AI. So I suppose I think *some more* PL people should bet on AI (but maybe not you!)
January 21, 2025 at 6:20 PM
Happy to mail you a couple. Email me, my address is on my website
January 21, 2025 at 7:15 AM
I think you’ve put your finger on the exact worldview mismatch because 5-10 years seems like an insanely long time horizon to me
January 21, 2025 at 5:07 AM
Why constrain the grammar - just pull more samples and keep the ones that pass :p
January 21, 2025 at 5:02 AM
8 years on, the future is here! xkcd.com/1813/
Vomiting Emoji
xkcd.com
December 27, 2024 at 1:42 AM
If I understand right, the private test set is only used during evaluation of the model - not available to the team doing the training
December 21, 2024 at 10:16 PM
Seems almost certain it’s deliberately trained on math reasoning. The way the o-series models seem to work is by long CoT, with reinforcement learning to impose correct reasoning. Not much public about how o3 works internally, but Chollet has some speculation: arcprize.org/blog/oai-o3-...
OpenAI o3 Breakthrough High Score on ARC-AGI-Pub
OpenAI o3 scores 75.7% on ARC-AGI public leaderboard.
arcprize.org
December 21, 2024 at 5:44 PM
A big jump on coding skill as well:
December 21, 2024 at 5:19 PM