Matt Beane
banner
mattbeane.bsky.social
Matt Beane
@mattbeane.bsky.social
Studying work involving intelligent machines, especially robots. @MITSloan PhD, @Ucsb Asst Prof, @Stanford and @MIT Digital Fellow, @Tedtalks @Thinkers50
Reposted by Matt Beane
This includes many of my papers, too. The point I am making is the findings in careful academic research likely represents a lower bound of AI capabilities at this point.
May 15, 2025 at 10:16 PM
Reposted by Matt Beane
I can’t

i just …

i can’t

www.404media.co/anthropic-cl...
February 4, 2025 at 1:30 PM
Reposted by Matt Beane
Hi Everyone!

We're hosting our Wharton AI and the Future of Work Conference on 5/21-22. Last year was a great event with some of the top papers on AI and work.

Paper submission deadline is 3/3. Come join us! Submit papers here: forms.gle/ozJ5xEaktXDE...
forms.gle
January 29, 2025 at 6:46 PM
Exciting new hobby project in the offing related to AI and skill. Involves a childhood passion, a wild leap into the unknown, made real via an order from Amazon just now. Will be 100% cool, I will be documenting things, sharing eventually. Feels like April 2023 again!
January 15, 2025 at 5:07 AM
The Silo is so good. Just superb. This generation's answer to the BSG remake.
January 13, 2025 at 1:44 AM
Reposted by Matt Beane
My hobby horse. You can simulate a rocket all you want, and use more energy on computation than the actual rocket would, but you won't get to orbit until you ignite rocket fuel. What if all the energy we are spending on simulating learning is not the juice we really need to make intelligence?
January 9, 2025 at 8:49 AM
Reposted by Matt Beane
Here's my end-of-year review of things we learned out about LLMs in 2024 - we learned a LOT of things simonwillison.net/2024/Dec/31/...

Table of contents:
December 31, 2024 at 6:10 PM
Reposted by Matt Beane
In 2024 we learned a lot about how AI is impacting work. People report that they're saving 30 minutes a day using AI (aka.ms/nfw2024), and randomized controlled trials reveal they’re creating 10% more documents, reading 11% fewer e-mails, and spending 4% less time on e-mail (aka.ms/productivity...).
December 31, 2024 at 7:39 PM
Reposted by Matt Beane
Independent evaluations of OpenAI’s o3 suggest that it passed math & reasoning benchmarks that were previously considered far out of reach for AI including achieving a score on ARC-AGI that was associated with actually achieving AGI (though the creators of the benchmark don’t think it o3 is AGI)
December 20, 2024 at 6:26 PM
Reposted by Matt Beane
Join me by the fireside this Friday with Matt Beane as we dive into one of today’s biggest workforce challenges: upskilling at scale. 📈

Linke below to hear the full discussion on Friday, December 13 at 11 am EST!

linktr.ee/RitaMcGrath

@mattbeane.bsky.social
December 9, 2024 at 6:45 PM
I propose a workshop.

Most engineers/CS working on AI presume away well established, profound brakes on AI diffusion.

Most social scientists presume away how AI use could reshape those brakes.

Let's gather these groups, examine these brakes 1-by-1, make grounded predictions.
December 7, 2024 at 7:12 PM
Reposted by Matt Beane
Models like o1 suggest that people won’t generally notice AGI-ish systems that are better than humans at most intellectual tasks, but which are not autonomous or self-directed

Most folks don’t regularly have a lot of tasks that bump up against the limits of human intelligence, so won’t see it
December 7, 2024 at 12:49 AM
Grateful for the opportunity to visit and learn from the professionals at the L&DI conference. And very glad to hear you found my talk so valuable, Garth! Means a lot.
In Dublin for the National Learning & Development Conference.

Some insightful opening remarks, followed by an absolutely stonking keynote by @mattbeane.bsky.social. Crystallised a lot of my worries around preserving expertise in software engineering during the age of GenAI. I have reading to do.
December 4, 2024 at 2:02 PM
Reposted by Matt Beane
I made an HRI Starter Pack!

If you are a Human-Robot Interaction or Social Robotics researcher and I missed you while scrolling through bsky's suggestions, just ping me and I'll add ya.

go.bsky.app/CsnNn3s
December 3, 2024 at 6:37 PM
David Meyer (v.) /ˈdeɪvɪd ˈmaɪ.ər/

To attribute complex, intentional design or deeper meaning to simple emergent behaviors of large language models, especially when such behaviors are more likely explained by straightforward technical constraints or training artifacts.
December 3, 2024 at 10:53 AM
Reposted by Matt Beane
My Thanksgiving post. A Kurt Vonnegut poem. He talks with Joe Heller (Catch 22 fame) about a billionaire. Key part:

Joe said, "I've got something he can never have"

And I said, "What on earth could that be, Joe?"

And Joe said, "The knowledge that I've got enough"

www.linkedin.com/pulse/kurt-v...
Kurt Vonnegut, Joe Heller, and How to Think Like a Mensch
This story remains my favorite Thanksgiving message; it reminds me to be grateful for what I have and of the evils of jealousy and destructive competition. I first posted it on my work matters blog mo...
www.linkedin.com
November 27, 2024 at 7:40 PM
Oh my dear god this is an incredible study.
Amazing paper (link next slide) by group incl. 2 Congolese researchers in Kinshasa looks at "official" corruption. Not just rogue officers but official policy to extort drivers = 80% of police revenue. Crazy shit: they worked w/ folks IN THE POLICE to secretly monitor bribes AND VARY bribe quotas!
November 27, 2024 at 7:04 PM
Not every day your work gets a healthy mention in the Sunday @nytimes.com!

The software talent market went into freefall in July of 2022.
Sarah Kessler takes us inside the maelstrom by investigating the impact on graduates of coding bootcamps. Great read.

www.nytimes.com/2024/11/24/b...
Do Coding Boot Camps Make Sense in an A.I. World?
Coding boot camps once looked like the golden ticket to an economically secure future. But as that promise fades, what should you do? Keep learning, until further notice.
www.nytimes.com
November 24, 2024 at 2:13 PM
Reposted by Matt Beane
Easy to get the wrong impression around here, but when you actually survey students, teachers, and parents they love AI.

In the survey, it is people who never used it who don’t like it. www.waltonfamilyfoundation.org/learning/the...
November 23, 2024 at 12:32 AM
Head to head twitter/bluesky social science test:

What are your go-to, recent empirical papers on surveillance and technology?
November 19, 2024 at 6:55 PM
Reposted by Matt Beane
Culture is definitely gonna play a part, but architecture is going to be key to creating a positive social media environment

Bluesky has given us so many tools like this to cut out all the crap that made even pre-Elon Twitter so toxic
Fun fact: If someone quotes a post of yours in a way that is unwelcome (as is commonplace on Twitter/X) there is a tool to combat the unwanted attention. Simply click the three dot menu on the quote post & click "Detach quote". This removes your post from their quote post. Useful to know, do share!
November 19, 2024 at 2:59 AM
When people ask me which model to use for writing, I say Claude. Have for months.

This is now my go-to example for explaining why.

To get why Claude is better, first read and get why Claude is better. To break this recursion, just compare their explanations of recursion - you'll see.
“Claude and ChatGPT, explain recursion in a clever way that is recursive.”
November 18, 2024 at 3:04 PM
Reposted by Matt Beane
I made a list of some of my favorite robotics people to follow, if any of former RoboticsTwitter is still looking for folks over here bsky.app/profile/did:...
November 17, 2024 at 1:51 PM
What a lovely gift on a Friday: using LLMs (admittedly, by playing to stereotypes) to confound scammers through a convincing "granny" avatar that will chat their time away. Vid worth watching for a laugh.

Not all disinformation harms the consumer!

news.virginmediao2.co.uk/o2-unveils-d...
O2 unveils Daisy, the AI granny wasting scammers’ time - Virgin Media O2
O2 has today unveiled the newest member of its fraud prevention team, 'Daisy'. As ‘Head of Scammer Relations’, this state-of-the-art AI Granny's mission is to talk with fraudsters and waste as much of...
news.virginmediao2.co.uk
November 15, 2024 at 11:07 PM
Reposted by Matt Beane
!!!

US power grid added battery equivalent of 20 nuclear reactors in past four years

www.theguardian.com/environment/...
November 10, 2024 at 4:19 PM