Sendhil Mullainathan
sendhil.bsky.social
Sendhil Mullainathan
@sendhil.bsky.social
Really excited about this paper. Part of our continued efforts to make sense of what ML models are and aren’t doing.
Can an AI model predict perfectly and still have a terrible world model?

What would that even mean?

Our new ICML paper (poster tomorrow!) formalizes these questions.

One result tells the story: A transformer trained on 10M solar systems nails planetary orbits. But it botches gravitational laws 🧵
July 14, 2025 at 1:53 PM
Reposted by Sendhil Mullainathan
Honest people don’t lie. Or do they? Liars aren’t honest. Or are they?
One puzzling conundrum in contemporary politics is that politicians who seem to be estranged from facts and evidence are nonetheless considered honest by their followers.
1/n
April 10, 2025 at 10:47 AM
Reposted by Sendhil Mullainathan
Last chance to apply to this year's Machine Learning in Economics Summer Institute, our AI/ML summer school for graduate students, co-organized with @sendhil.bsky.social @asheshrambachan.bsky.social @aleximas.bsky.social @lindseyraymond.bsky.social: www.chicagobooth.edu/research/cen... #econsky
Machine Learning in Economics Summer Institute 2025 (MLESI25)
<span id="docs-internal-guid-53f97f20-7fff-b55c-c89c-4f85d663a48c" style="color: #39393a;"><span style="color: #000000;">MLESI<span id="docs-internal-guid-394e21c4-7fff-1437-16ff-fdbb1ce1d34c" style="...
www.chicagobooth.edu
April 8, 2025 at 3:36 AM
Reposted by Sendhil Mullainathan
First was #SSAC day 2 w/sessions on basketball analytics (@deepakmalhotra.bsky.social, S Bird, E Wasch, M Zarren, D Morey), NFL front office (L Horton, @scottpioli51.bsky.social, K Demoff, @espnbillc.bsky.social), & AI (@sendhil.bsky.social, @pablo.show, Morey) www.youtube.com/watch?v=jG1A... (2/5)
SSAC25: Main Stage: Day 2
YouTube video by 42 Analytics
www.youtube.com
March 10, 2025 at 3:10 AM
Reposted by Sendhil Mullainathan
An incredible opportunity for the right person:

The Santa Fe Institute is seeking applications for full-time resident faculty positions at all academic levels

For me, this has been the best possible job ever.

More info here: santafe.edu/about/jobs/r...
sfiscience
SFI seeks applications for full-time, 12-month resident faculty positions at all academic levels.
santafe.edu
January 8, 2025 at 7:39 PM
Reposted by Sendhil Mullainathan
For those who didn't make it to #ASSA2025: strongly recommend @sendhil.bsky.social's AEA distinguished lecture, available at www.aeaweb.org/webcasts/202... (starting at minute 16)!
American Economic Association: AEA Excellence Awards and Distinguished Lecture
www.aeaweb.org
January 8, 2025 at 4:16 PM
Reposted by Sendhil Mullainathan
Some of my thoughts on OpenAI's o3 and the ARC-AGI benchmark

aiguide.substack.com/p/did-openai...
Did OpenAI Just Solve Abstract Reasoning?
OpenAI’s o3 model aces the "Abstraction and Reasoning Corpus" — but what does it mean?
aiguide.substack.com
December 23, 2024 at 2:38 PM
Reposted by Sendhil Mullainathan
Very interesting interview with @sendhil.bsky.social on AI. Fresh perspective on several topics, including value of AI in education. youtu.be/z_svj3NP968
How AI can truly change the world – Sendhil Mullainathan & David Yanagizawa-Drott
YouTube video by Economics. For Society.
youtu.be
December 16, 2024 at 2:36 AM
Reposted by Sendhil Mullainathan
What happens when @sendhil.bsky.social and David Yanagizawa-Drott come together to discuss AI? In this inaugural episode of 'Thought Supply' the conversation moves beyond the buzzwords to explore the profound economic implications.

Premiering Dec 13 8pm CET youtu.be/z_svj3NP968
#EconomicsForSociety
December 13, 2024 at 9:23 AM
Reposted by Sendhil Mullainathan
Recently accepted by #QJE, “Do Financial Concerns Make Workers Less Productive,” by Kaur, Mullainathan (@sendhil.bsky.social), Oh, and Schilbach (@fschilbach.bsky.social): doi.org/10.1093/qje/...
Do Financial Concerns Make Workers Less Productive?*
Abstract. Workers who are worried about their personal finances may find it hard to focus at work. If so, reducing financial concerns could by itself incre
doi.org
December 13, 2024 at 2:18 PM
Reposted by Sendhil Mullainathan
Our paper proposes new metrics for world model recovery based on the Myhill-Nerode theorem from language theory:

Co-authors: Justin Chen, Ashesh Rambachan, Jon Kleinberg, Sendhil Mullainathan (@sendhil.bsky.social)
December 12, 2024 at 6:59 PM
Reposted by Sendhil Mullainathan
🔖 Looking forward to digging into the latest working paper from Jens Ludwig, @sendhil.bsky.social and Ashesh Rambachan on the use of LLMs in economics research.

#econsky #scisky #phdsky

arxiv.org/abs/2412.070...
Large Language Models: An Applied Econometric Framework
Large language models (LLMs) are being used in economics research to form predictions, label text, simulate human responses, generate hypotheses, and even produce data for times and places where such ...
arxiv.org
December 12, 2024 at 7:56 PM
Reposted by Sendhil Mullainathan
Economic valuations fluctuate in ways empirical research cannot fully explain

What information are we missing? Economic theories emphasize the role of hard-to-quantify beliefs and perceptions

My job market paper develops algorithms + measurement to quantify perceptions of firms
November 20, 2024 at 8:16 PM
Reposted by Sendhil Mullainathan
August 24, 2024 at 11:12 AM
Reposted by Sendhil Mullainathan
To researchers doing LLM evaluation: prompting is *not a substitute* for direct probability measurements. Check out the camera-ready version of our work, to appear at EMNLP 2023! (w/ @rplevy.bsky.social)

Paper: arxiv.org/abs/2305.13264

Original thread: twitter.com/_jennhu/stat...
October 24, 2023 at 3:03 PM