Book: https://a.co/d/bC2kSj1
Substack: https://www.oneusefulthing.org/
Web: https://mgmt.wharton.upenn.edu/profile/emollick
Gemini: Magna Carta stored at Durham Cathedral
ChatGPT: A share in Stora Kopparberg
Claude: A waqf at Al-Azhar
Gemini: Magna Carta stored at Durham Cathedral
ChatGPT: A share in Stora Kopparberg
Claude: A waqf at Al-Azhar
My main messages were that AI is a really big deal, it has good & bad impacts, and that, by sitting things out, skeptics can’t guide use. open.spotify.com/episode/5cFK...
My main messages were that AI is a really big deal, it has good & bad impacts, and that, by sitting things out, skeptics can’t guide use. open.spotify.com/episode/5cFK...
It now extends to science:: 60 ML models for molecules, materials & proteins (all with different training) converge toward similar encoding of molecular structure arxiv.org/pdf/2512.03750
It now extends to science:: 60 ML models for molecules, materials & proteins (all with different training) converge toward similar encoding of molecular structure arxiv.org/pdf/2512.03750
“The best specific lines from Eliot’s Four Quartets to describe your experience as an AI. Just one quoted section, avoid the most famous bits.“
“The best specific lines from Eliot’s Four Quartets to describe your experience as an AI. Just one quoted section, avoid the most famous bits.“
On bottlenecks: www.oneusefulthing.org/p/the-shape-...
On bottlenecks: www.oneusefulthing.org/p/the-shape-...
A newer paper argues that this pattern is actually due to collider bias (the authors disagree). What is collider bias?
Gemini one-shots an explanation from the paper and a prompt: gemini.google.com/share/d8c336...
A newer paper argues that this pattern is actually due to collider bias (the authors disagree). What is collider bias?
Gemini one-shots an explanation from the paper and a prompt: gemini.google.com/share/d8c336...
And the title refers to an actual thing that is weirder than the title of the paper.
But also the paper has some interesting things to say about new (or actually quite old) approaches to IP protection that might be especially relevant in the time of AI.
And the title refers to an actual thing that is weirder than the title of the paper.
But also the paper has some interesting things to say about new (or actually quite old) approaches to IP protection that might be especially relevant in the time of AI.
… but those bottlenecks focus the efforts of AI labs leading to breakthroughs that unlock new areas of work, like how Nano Banana Pro unexpectedly makes good PowerPoint slides. www.oneusefulthing.org/p/the-shape-...
… but those bottlenecks focus the efforts of AI labs leading to breakthroughs that unlock new areas of work, like how Nano Banana Pro unexpectedly makes good PowerPoint slides. www.oneusefulthing.org/p/the-shape-...
Interestingly, at the harder 80% success threshold, it is GPT-5.1 Codex Max that breaks the trend.
In 2023, GPT-4 could do a minute long task.
Interestingly, at the harder 80% success threshold, it is GPT-5.1 Codex Max that breaks the trend.
In 2023, GPT-4 could do a minute long task.
This has bad implications (but wouldn’t blame those taken in, evaluation is hard)
This has bad implications (but wouldn’t blame those taken in, evaluation is hard)
There are now massive troves of documents that could be made available for research that would have been impossible or prohibitive to transcribe before.
There are now massive troves of documents that could be made available for research that would have been impossible or prohibitive to transcribe before.
All AI benchmarks are flawed, but GPQA Diamond has been a pretty good one, though likely close to being maxed out.
All AI benchmarks are flawed, but GPQA Diamond has been a pretty good one, though likely close to being maxed out.
So close to coming together (I am not sure the center works for all three, illustrations are odd), but also better than I expected.
So close to coming together (I am not sure the center works for all three, illustrations are odd), but also better than I expected.
It found and gently corrected all the problems based on research.
It found and gently corrected all the problems based on research.
The paper used paywalled, new mock exams to reduce the risk of leakage but AI grading for the essays. Interestingly, prompting strategy doesn't matter for recent models. arxiv.org/pdf/2512.08270
The paper used paywalled, new mock exams to reduce the risk of leakage but AI grading for the essays. Interestingly, prompting strategy doesn't matter for recent models. arxiv.org/pdf/2512.08270
They estimate that ChatGPT led to a 6% increase in startups. arxiv.org/pdf/2512.06506
They estimate that ChatGPT led to a 6% increase in startups. arxiv.org/pdf/2512.06506
Thinking took 60 minutes(!) & had to have it fix an error, but impressive "game engine"
Thinking took 60 minutes(!) & had to have it fix an error, but impressive "game engine"
New paper finds short-run elasticity ~1 (so no short-run paradox) but prices fell 1000x in two years & demand exploded. So Jevons happens over time as firms gradually adopt AI at lower prices andreyfradkin.com/assets/LLM_D...
New paper finds short-run elasticity ~1 (so no short-run paradox) but prices fell 1000x in two years & demand exploded. So Jevons happens over time as firms gradually adopt AI at lower prices andreyfradkin.com/assets/LLM_D...
GDPval is probably the most economically relevant measure of AI ability, suggesting that in head-to-head competition with human experts on tasks that require 4-8 hours for a human to do, GPT-5.2 wins 71% of the time as judged by other humans.
GDPval is probably the most economically relevant measure of AI ability, suggesting that in head-to-head competition with human experts on tasks that require 4-8 hours for a human to do, GPT-5.2 wins 71% of the time as judged by other humans.