Lightnews — Scholar-powered news

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

Huge thank you to Jaan Tallinn and the Survival & Flourishing Fund team for supporting all this work – and much more!

For a full list of their donations, see below

And note that there is still room for lots more $$ in this space – so do get involved!

x.com/sff_is_twee...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

(Also not listed here are my modest personal investments into @GoodfireAI @aiunderwriting @elicitorg @HarmonyIntel – all of which have a for-profit approach to advancing AI safety – a great model where it works!)

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

Obviously this list isn't exhaustive – it's just what I've personally had time to research & understand well enough to endorse

And don't read much into the order – the most proven orgs are at the top, but your $$ might have more impact farther down the list 🤔

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

19) Seldon Lab @seldonai

They support early-stage AGI security startups, including Andon Labs, the makers of Vending Bench, who are researching autonomous AI organizations.

I received multiple strong endorsements for Seldon in my research.

x.com/andonlabs/s...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

18) Zhijing Jin @ University of Toronto

Why is there no "MMLU for morality"?

Zhijing's group is doing some of the most ambitious moral reasoning benchmarking in the world today – hopefully they can fill this gap!

x.com/ZhijingJin/...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

17) Forethought @forethought_org

If you're at all worried about the AIs taking over, it seems like you should also worry about people using AIs to take over.

This work was clarifying for me

x.com/TomDavidson...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

16) Singapore AI Safety Hub @aisafetysg

Singapore has famously good governance, and is a natural, neutral middle ground / meeting point for US and Chinese officials and researchers.

A co-working space designed to support the Singapore Gov seems like a great investment

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

15) CivAI @civai_org

Emotionally resonant demonstrations of AI capabilities – this one provides both a window into AI psychosis & a preview of the ever-stranger AI future

Crazy that the infamous "AI in a box" experiment can now be run with actual AIs

x.com/civai_org/s...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

14) Institute for AI Policy and Strategy @IAPSai

I'm allergic to US-vs-China framing, but everyone I talked to agreed that their work on hardware-based governance will be useful in any scenario, including those involving international coordination

x.com/peterwildef...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

13) Timaeus @TimaeusResearch

They are developing their own interpretability paradigm, focused on how models develop throughout the training process, which I think of as "Embryology of AI"

Fascinating stuff, and starting to scale

x.com/danielmurfe...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

12) Secure AI Project

They are working with legislators including @Scott_Wiener and @AlexBores to create State-level AI regulation that even @deanwball finds "worthy of applause" 👏

x.com/Thomas_Wood...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

11) Flourishing Future Foundation

A "neglected approaches approach" to AI safety

re: "Self-Other Overlap", @ESYudkowsky said:

“I do not think superalignment is possible to our civilization; but if it were, it would come out of research like this"

x.com/juddrosenbl...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

10) PIBBSS

Interdisciplinary research that brings experts in Law, Game Theory, Ecology, Philosophy & more together to study AI from novel angles

I've done at least 4 podcasts with PIBBSS folks @gabriel_weil @xuanalogue @aronvallinder @AmmannNora

x.com/pibbssai/st...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

9) SecureDNA

I'm a freedom-loving American, but "It shouldn’t be easy to buy synthetic DNA fragments to recreate the 1918 flu virus"

Their tech is FREE for DNA synthesis companies

I always admire the unilateral provision of global public goods!

x.com/kesvelt/sta...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

8) SecureBio @SecureBio

Remember the pandemic? That sucked...

We're doing MUCH less than we should be to prepare for the next one, but we do have a few heroes out there doing early detection wastewater monitoring 🙏

x.com/Simon__Grim...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

7) MATS

They provide short, intensive training programs that help people transition their careers into mechanistic interpretability and other AI safety work.

Check out the mentors listed in this thread – a true who's who of top safety researchers

x.com/ryan_kidd44...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

6) FarAI

A well-rounded AI safety org that does research, red teams defense-in-depth systems, and supports international dialogue.

Their finding that "superhuman" GO AIs are vulnerable to adversarial attacks is a classic

Currently seeking a COO 👀

x.com/farairesear...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

5) The Center for AI Safety

Remember the AI extinction risk statement Sam A, Dario, and Demis all signed?

That was @DanHendrycks and @ai_risks

A super interesting mix of work, spanning benchmarks, model representations research, and policy leadership

x.com/ai_risks/st...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

4) Palisade

Known for "scary demos", they specialize in showing that, under the right circumstances, today's AIs sometimes behave very badly.

Exactly how to interpret these results is contested, but at the very least I'm glad we're talking about it!

x.com/PalisadeAI/...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

3) METR

Most famous for their work on autonomous task completion, they also study models' ability to conduct ML research, assist in creation of bioweapons, and more.

Important questions!

x.com/METR_Evals/...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

2) Apollo Research @apolloaievals

They work with AI labs to test models BEFORE release

Most recently, they tested whether OpenAI's Deliberative Alignment strategy can eliminate "scheming" behavior. (Spoiler: not quite)

I read their work immediately

x.com/OpenAI/stat...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

1) The AI Whistleblower Initiative

They provide expert advise & pro bono legal help to concerned insiders at AGI labs

Whistleblowers have already proven important

My experience on the GPT-4 Red Team makes me particularly passionate about this one!

twitter.com/AIWI_Offici...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

0b) GiveWell provides another kind of baseline. They do the most rigorous charity impact analysis in the world today, and they popularized malaria as a uniquely high-leverage cause

They even red-team their own analysis with frontier reasoning LLMs 💡

x.com/GiveWell/st...

September 20, 2025 at 12:29 PM

nathanlabenz.bsky.social

@nathanlabenz.bsky.social

0a) For calibration, I compare everything to GiveDirectly. If you don't believe that an organization can do more good with your next $1K than cutting a poor Kenyan infant's risk of death by 50%, then ... just save babies

We gave to GiveDirectly first

x.com/GiveDirectl...

September 20, 2025 at 12:29 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news