Lightnews — Scholar-powered news

Reposted by Matt Beane

Ethan Mollick

@emollick.bsky.social

This includes many of my papers, too. The point I am making is the findings in careful academic research likely represents a lower bound of AI capabilities at this point.

May 15, 2025 at 10:16 PM

Reposted by Matt Beane

Clive Thompson

@clivethompson.bsky.social

I can’t

i just …

i can’t

www.404media.co/anthropic-cl...

February 4, 2025 at 1:30 PM

Reposted by Matt Beane

Daniel Rock

@danielrock.bsky.social

Hi Everyone!

We're hosting our Wharton AI and the Future of Work Conference on 5/21-22. Last year was a great event with some of the top papers on AI and work.

Paper submission deadline is 3/3. Come join us! Submit papers here: forms.gle/ozJ5xEaktXDE...

forms.gle

January 29, 2025 at 6:46 PM

Matt Beane

@mattbeane.bsky.social

Exciting new hobby project in the offing related to AI and skill. Involves a childhood passion, a wild leap into the unknown, made real via an order from Amazon just now. Will be 100% cool, I will be documenting things, sharing eventually. Feels like April 2023 again!

January 15, 2025 at 5:07 AM

Matt Beane

@mattbeane.bsky.social

The Silo is so good. Just superb. This generation's answer to the BSG remake.

January 13, 2025 at 1:44 AM

Reposted by Matt Beane

Rodney Brooks

@rodneyabrooks.bsky.social

My hobby horse. You can simulate a rocket all you want, and use more energy on computation than the actual rocket would, but you won't get to orbit until you ignite rocket fuel. What if all the energy we are spending on simulating learning is not the juice we really need to make intelligence?

January 9, 2025 at 8:49 AM

Reposted by Matt Beane

Simon Willison

@simonwillison.net

Here's my end-of-year review of things we learned out about LLMs in 2024 - we learned a LOT of things simonwillison.net/2024/Dec/31/...

Table of contents:

The GPT-4 barrier was comprehensively broken
Some of those GPT-4 models run on my laptop
LLM prices crashed, thanks to competition and increased efficiency
Multimodal vision is common, audio and video are starting to emerge
Voice and live camera mode are science fiction come to life
Prompt driven app generation is a commodity already
Universal access to the best models lasted for just a few short months
“Agents” still haven’t really happened yet
Evals really matter
Apple Intelligence is bad, Apple’s MLX library is excellent
The rise of inference-scaling “reasoning” models
Was the best currently available LLM trained in China for less than $6m?
The environmental impact got better
The environmental impact got much, much worse
The year of slop
Synthetic training data works great
LLMs somehow got even harder to use
Knowledge is incredibly unevenly distributed
LLMs need better criticism
Everything tagged “llms” on my blog in 2024

December 31, 2024 at 6:10 PM

Reposted by Matt Beane

Jaime Teevan

@teevan.bsky.social

In 2024 we learned a lot about how AI is impacting work. People report that they're saving 30 minutes a day using AI (aka.ms/nfw2024), and randomized controlled trials reveal they’re creating 10% more documents, reading 11% fewer e-mails, and spending 4% less time on e-mail (aka.ms/productivity...).

December 31, 2024 at 7:39 PM

Reposted by Matt Beane

Ethan Mollick

@emollick.bsky.social

Independent evaluations of OpenAI’s o3 suggest that it passed math & reasoning benchmarks that were previously considered far out of reach for AI including achieving a score on ARC-AGI that was associated with actually achieving AGI (though the creators of the benchmark don’t think it o3 is AGI)

December 20, 2024 at 6:26 PM

Reposted by Matt Beane

Rita McGrath

@rgmcgrath.bsky.social

Join me by the fireside this Friday with Matt Beane as we dive into one of today’s biggest workforce challenges: upskilling at scale. 📈

Linke below to hear the full discussion on Friday, December 13 at 11 am EST!

linktr.ee/RitaMcGrath

@mattbeane.bsky.social

December 9, 2024 at 6:45 PM

Matt Beane

@mattbeane.bsky.social

I propose a workshop.

Most engineers/CS working on AI presume away well established, profound brakes on AI diffusion.

Most social scientists presume away how AI use could reshape those brakes.

Let's gather these groups, examine these brakes 1-by-1, make grounded predictions.

December 7, 2024 at 7:12 PM

Reposted by Matt Beane

Ethan Mollick

@emollick.bsky.social

Models like o1 suggest that people won’t generally notice AGI-ish systems that are better than humans at most intellectual tasks, but which are not autonomous or self-directed

Most folks don’t regularly have a lot of tasks that bump up against the limits of human intelligence, so won’t see it

December 7, 2024 at 12:49 AM

Matt Beane

@mattbeane.bsky.social

Grateful for the opportunity to visit and learn from the professionals at the L&DI conference. And very glad to hear you found my talk so valuable, Garth! Means a lot.

Garth Gilmour @garthgilmour.bsky.social · Dec 4

In Dublin for the National Learning & Development Conference.

Some insightful opening remarks, followed by an absolutely stonking keynote by @mattbeane.bsky.social. Crystallised a lot of my worries around preserving expertise in software engineering during the age of GenAI. I have reading to do.

A speaker at the National Learning & Development Conference.

December 4, 2024 at 2:02 PM

Reposted by Matt Beane

Tom Williams

@tomwilliams.phd

I made an HRI Starter Pack!

If you are a Human-Robot Interaction or Social Robotics researcher and I missed you while scrolling through bsky's suggestions, just ping me and I'll add ya.

go.bsky.app/CsnNn3s

December 3, 2024 at 6:37 PM

Matt Beane

@mattbeane.bsky.social

David Meyer (v.) /ˈdeɪvɪd ˈmaɪ.ər/

To attribute complex, intentional design or deeper meaning to simple emergent behaviors of large language models, especially when such behaviors are more likely explained by straightforward technical constraints or training artifacts.

December 3, 2024 at 10:53 AM

Reposted by Matt Beane

Bob Sutton

@bobsutton.net

My Thanksgiving post. A Kurt Vonnegut poem. He talks with Joe Heller (Catch 22 fame) about a billionaire. Key part:

Joe said, "I've got something he can never have"

And I said, "What on earth could that be, Joe?"

And Joe said, "The knowledge that I've got enough"

www.linkedin.com/pulse/kurt-v...

Kurt Vonnegut, Joe Heller, and How to Think Like a Mensch

This story remains my favorite Thanksgiving message; it reminds me to be grateful for what I have and of the evils of jealousy and destructive competition. I first posted it on my work matters blog mo...

www.linkedin.com

November 27, 2024 at 7:40 PM

Matt Beane

@mattbeane.bsky.social

Oh my dear god this is an incredible study.

Kevin A. Bryan @afinetheorem.bsky.social · Nov 27

Amazing paper (link next slide) by group incl. 2 Congolese researchers in Kinshasa looks at "official" corruption. Not just rogue officers but official policy to extort drivers = 80% of police revenue. Crazy shit: they worked w/ folks IN THE POLICE to secretly monitor bribes AND VARY bribe quotas!

November 27, 2024 at 7:04 PM

Matt Beane

@mattbeane.bsky.social

Not every day your work gets a healthy mention in the Sunday @nytimes.com!

The software talent market went into freefall in July of 2022.
Sarah Kessler takes us inside the maelstrom by investigating the impact on graduates of coding bootcamps. Great read.

www.nytimes.com/2024/11/24/b...

Do Coding Boot Camps Make Sense in an A.I. World?

Coding boot camps once looked like the golden ticket to an economically secure future. But as that promise fades, what should you do? Keep learning, until further notice.

www.nytimes.com

November 24, 2024 at 2:13 PM

Reposted by Matt Beane

Ethan Mollick

@emollick.bsky.social

Easy to get the wrong impression around here, but when you actually survey students, teachers, and parents they love AI.

In the survey, it is people who never used it who don’t like it. www.waltonfamilyfoundation.org/learning/the...

November 23, 2024 at 12:32 AM

Matt Beane

@mattbeane.bsky.social

Head to head twitter/bluesky social science test:

What are your go-to, recent empirical papers on surveillance and technology?

November 19, 2024 at 6:55 PM

Reposted by Matt Beane

Nick Brumfield

@nickjbrumfield.bsky.social

Culture is definitely gonna play a part, but architecture is going to be key to creating a positive social media environment

Bluesky has given us so many tools like this to cut out all the crap that made even pre-Elon Twitter so toxic

Tom Ashbyトム ∙ アシュビー @tomaashby.bsky.social · Nov 12

Fun fact: If someone quotes a post of yours in a way that is unwelcome (as is commonplace on Twitter/X) there is a tool to combat the unwanted attention. Simply click the three dot menu on the quote post & click "Detach quote". This removes your post from their quote post. Useful to know, do share!

November 19, 2024 at 2:59 AM

Matt Beane

@mattbeane.bsky.social

When people ask me which model to use for writing, I say Claude. Have for months.

This is now my go-to example for explaining why.

To get why Claude is better, first read and get why Claude is better. To break this recursion, just compare their explanations of recursion - you'll see.

Ethan Mollick @emollick.bsky.social · Nov 18

“Claude and ChatGPT, explain recursion in a clever way that is recursive.”

November 18, 2024 at 3:04 PM

Reposted by Matt Beane

Mikell Taylor

@mikell.bsky.social

I made a list of some of my favorite robotics people to follow, if any of former RoboticsTwitter is still looking for folks over here bsky.app/profile/did:...

November 17, 2024 at 1:51 PM

Matt Beane

@mattbeane.bsky.social

What a lovely gift on a Friday: using LLMs (admittedly, by playing to stereotypes) to confound scammers through a convincing "granny" avatar that will chat their time away. Vid worth watching for a laugh.

Not all disinformation harms the consumer!

news.virginmediao2.co.uk/o2-unveils-d...