Lightnews — Scholar-powered news

Reposted by akbir khan

Epoch AI

@epochai.bsky.social

We’ve added four new benchmarks to the Epoch AI Benchmarking Hub: Aider Polyglot, WeirdML, Balrog, and Factorio Learning Environment!

Before we only featured our own evaluation results, but this new data comes from trusted external leaderboards. And we've got more on the way 🧵

May 8, 2025 at 3:00 PM

Reposted by akbir khan

Epoch AI

@epochai.bsky.social

4. Factorio Learning Environment by Jack Hopkins, Märt Bakler , and
@akbir.bsky.social

This benchmark uses the factory-building game Factorio to test complex, long-term planning, with settings for lab-play (structured tasks) and open-play (unbounded growth).
jackhopkins.github.io/factorio-lea...

Factorio Learning Environment

Claude Sonnet 3.5 builds factories

jackhopkins.github.io

May 8, 2025 at 3:00 PM

Reposted by akbir khan

Johannes Gasteiger🔸

@gasteigerjo.bsky.social

New Anthropic blog post: Subtle sabotage in automated researchers.

As AI systems increasingly assist with AI research, how do we ensure they're not subtly sabotaging that research? We show that malicious models can undermine ML research tasks in ways that are hard to detect.

March 25, 2025 at 4:03 PM

akbir khan

@akbir.bsky.social

control is a complimentary approach to alignment.

its really sensible, practical and can be done now, even before systems are superintelligent.

youtu.be/6Unxqr50Kqg?...

Controlling powerful AI

YouTube video by Anthropic

youtu.be

March 18, 2025 at 3:22 PM

Reposted by akbir khan

Ethan Mollick

@emollick.bsky.social

This is a crazy paper. Fine-tuning a big GPT-4o on a small amount of insecure code or even "bad numbers" (like 666) makes them misaligned in almost everything else. They are more likely to start offering misinformation, spouting anti-human values, and talk about admiring dictators. Why is unclear.

February 25, 2025 at 9:01 PM

akbir khan

@akbir.bsky.social

www.anthropic.com/news/paris-a...

Statement from Dario Amodei on the Paris AI Action Summit

A call for greater focus and urgency

www.anthropic.com

February 11, 2025 at 9:08 PM

akbir khan

@akbir.bsky.social

This is the entire goal

Grace @gracekind.net · Jan 31

It’s weird to live in a world where AI models are more aligned than the CEOs of the companies creating them

February 1, 2025 at 2:13 AM

akbir khan

@akbir.bsky.social

darioamodei.com/on-deepseek-...

Dario Amodei — On DeepSeek and Export Controls

On DeepSeek and Export Controls

darioamodei.com

January 30, 2025 at 2:09 AM

Reposted by akbir khan

Hank Green

@hankgreen.bsky.social

The fact that Deepseek R1 was released three days /before/ Stargate means these guys stood in front of Trump and said they needed half a trillion dollars while they knew R1 was open source and trained for $5M.

Beautiful.

Trump announces 500B in AI funding. Five days ago.

January 28, 2025 at 3:02 AM

Reposted by akbir khan

Zack Witten

@zswitten.bsky.social

Can anyone get a shorter DeepSeek R1 CoT than this?

January 24, 2025 at 6:11 AM

Reposted by akbir khan

Tom Everitt

@tom4everitt.bsky.social

Process based supervision done right, and with pretty CIDs to illustrate :)

January 23, 2025 at 8:33 PM

Reposted by akbir khan

Mark Riedl

@markriedl.bsky.social

I don’t really have the energy for politics right now. So I will observe without comment:

Executive Order 14110 was revoked (Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence)

January 21, 2025 at 12:34 AM

akbir khan

@akbir.bsky.social

R1 model is impressive

January 21, 2025 at 10:21 PM

Reposted by akbir khan

Sarah Jones

@sarahjones.bsky.social

January 16, 2025 at 6:33 PM

akbir khan

@akbir.bsky.social

fuck the tabloids were right

www.nytimes.com/2025/01/15/t...

She Is in Love With ChatGPT

A 28-year-old woman with a busy social life spends hours on end talking to her A.I. boyfriend for advice and consolation. And yes, they do have sex.

www.nytimes.com

January 16, 2025 at 6:46 AM

Reposted by akbir khan

Ethan Mollick

@emollick.bsky.social

New randomized, controlled trial by the World Bank of students using GPT-4 as a tutor in Nigeria. Six weeks of after-school AI tutoring = 2 years of typical learning gains, outperforming 80% of other educational interventions.

And it helped all students, especially girls who were initially behind.

January 15, 2025 at 8:58 PM

Reposted by akbir khan

Ethan Mollick

@emollick.bsky.social

Generative AI has flaws and biases, and there is a tendency for academics to fix on that (85% of equity LLM papers focus on harms)…

…yet in many ways LLMs are uniquely powerful among new technologies for helping people equitably in education and healthcare. We need an urgent focus on how to do that

January 14, 2025 at 5:45 PM

Reposted by akbir khan

Ethan Mollick

@emollick.bsky.social

On one hand, this paper finds adding inference-time compute (like o1 does) improves medical reasoning, which is an important finding suggesting a way to continue to improve AI performance in medicine

On the other hand, scientific illustrations are apparently just anime now arxiv.org/pdf/2501.06458

January 14, 2025 at 5:56 AM

akbir khan

@akbir.bsky.social

my metabolism is noticeably higher in london than the bay.

January 13, 2025 at 3:49 PM

akbir khan

@akbir.bsky.social

What can AI researchers do *today* that AI developers will find useful for ensuring the safety of future advanced AI systems? To ring in the new year, the Anthropic Alignment Science team is sharing some thoughts on research directions we think are important.
alignment.anthropic.com/2025/recomme...

Recommendations for Technical AI Safety Research Directions

alignment.anthropic.com

January 10, 2025 at 9:03 PM

Reposted by akbir khan

Hank Green

@hankgreen.bsky.social

My hottest take is that nothing makes any sense at all outside of the context of the constantly increasing value of human life, but that increase in value is so invisible (and exists in a world that was built for previous, lower values) that we constantly think the opposite has happened.

January 5, 2025 at 7:08 PM

Reposted by akbir khan

Zack Witten

@zswitten.bsky.social

darioamodei.com/machines-of-...

Dario Amodei — Machines of Loving Grace

How AI Could Transform the World for the Better

darioamodei.com

January 4, 2025 at 5:38 PM

akbir khan

@akbir.bsky.social

Nothing kills my excitement of returning to the US like the response i get from CBP officers.

January 4, 2025 at 4:13 AM

Reposted by akbir khan

Andrew Lampinen

@lampinen.bsky.social

Felix Hill was such an incredible mentor — and occasional cold water swimming partner — to me. He's a huge part of why I joined DeepMind and how I've come to approach research. Even a month later, it's still hard to believe he's gone.

Felix Hill and some other DMers and I after cold water swimming at Parliament Hill Lido a few years ago

January 2, 2025 at 7:01 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news