For a full list of their donations, see below
And note that there is still room for lots more $$ in this space – so do get involved!
x.com/sff_is_twee...
For a full list of their donations, see below
And note that there is still room for lots more $$ in this space – so do get involved!
x.com/sff_is_twee...
And don't read much into the order – the most proven orgs are at the top, but your $$ might have more impact farther down the list 🤔
And don't read much into the order – the most proven orgs are at the top, but your $$ might have more impact farther down the list 🤔
They support early-stage AGI security startups, including Andon Labs, the makers of Vending Bench, who are researching autonomous AI organizations.
I received multiple strong endorsements for Seldon in my research.
x.com/andonlabs/s...
They support early-stage AGI security startups, including Andon Labs, the makers of Vending Bench, who are researching autonomous AI organizations.
I received multiple strong endorsements for Seldon in my research.
x.com/andonlabs/s...
Why is there no "MMLU for morality"?
Zhijing's group is doing some of the most ambitious moral reasoning benchmarking in the world today – hopefully they can fill this gap!
x.com/ZhijingJin/...
Why is there no "MMLU for morality"?
Zhijing's group is doing some of the most ambitious moral reasoning benchmarking in the world today – hopefully they can fill this gap!
x.com/ZhijingJin/...
If you're at all worried about the AIs taking over, it seems like you should also worry about people using AIs to take over.
This work was clarifying for me
x.com/TomDavidson...
If you're at all worried about the AIs taking over, it seems like you should also worry about people using AIs to take over.
This work was clarifying for me
x.com/TomDavidson...
Singapore has famously good governance, and is a natural, neutral middle ground / meeting point for US and Chinese officials and researchers.
A co-working space designed to support the Singapore Gov seems like a great investment
Singapore has famously good governance, and is a natural, neutral middle ground / meeting point for US and Chinese officials and researchers.
A co-working space designed to support the Singapore Gov seems like a great investment
Emotionally resonant demonstrations of AI capabilities – this one provides both a window into AI psychosis & a preview of the ever-stranger AI future
Crazy that the infamous "AI in a box" experiment can now be run with actual AIs
x.com/civai_org/s...
Emotionally resonant demonstrations of AI capabilities – this one provides both a window into AI psychosis & a preview of the ever-stranger AI future
Crazy that the infamous "AI in a box" experiment can now be run with actual AIs
x.com/civai_org/s...
I'm allergic to US-vs-China framing, but everyone I talked to agreed that their work on hardware-based governance will be useful in any scenario, including those involving international coordination
x.com/peterwildef...
I'm allergic to US-vs-China framing, but everyone I talked to agreed that their work on hardware-based governance will be useful in any scenario, including those involving international coordination
x.com/peterwildef...
They are developing their own interpretability paradigm, focused on how models develop throughout the training process, which I think of as "Embryology of AI"
Fascinating stuff, and starting to scale
x.com/danielmurfe...
They are developing their own interpretability paradigm, focused on how models develop throughout the training process, which I think of as "Embryology of AI"
Fascinating stuff, and starting to scale
x.com/danielmurfe...
They are working with legislators including @Scott_Wiener and @AlexBores to create State-level AI regulation that even @deanwball finds "worthy of applause" 👏
x.com/Thomas_Wood...
They are working with legislators including @Scott_Wiener and @AlexBores to create State-level AI regulation that even @deanwball finds "worthy of applause" 👏
x.com/Thomas_Wood...
A "neglected approaches approach" to AI safety
re: "Self-Other Overlap", @ESYudkowsky said:
“I do not think superalignment is possible to our civilization; but if it were, it would come out of research like this"
x.com/juddrosenbl...
A "neglected approaches approach" to AI safety
re: "Self-Other Overlap", @ESYudkowsky said:
“I do not think superalignment is possible to our civilization; but if it were, it would come out of research like this"
x.com/juddrosenbl...
Interdisciplinary research that brings experts in Law, Game Theory, Ecology, Philosophy & more together to study AI from novel angles
I've done at least 4 podcasts with PIBBSS folks @gabriel_weil @xuanalogue @aronvallinder @AmmannNora
x.com/pibbssai/st...
Interdisciplinary research that brings experts in Law, Game Theory, Ecology, Philosophy & more together to study AI from novel angles
I've done at least 4 podcasts with PIBBSS folks @gabriel_weil @xuanalogue @aronvallinder @AmmannNora
x.com/pibbssai/st...
I'm a freedom-loving American, but "It shouldn’t be easy to buy synthetic DNA fragments to recreate the 1918 flu virus"
Their tech is FREE for DNA synthesis companies
I always admire the unilateral provision of global public goods!
x.com/kesvelt/sta...
I'm a freedom-loving American, but "It shouldn’t be easy to buy synthetic DNA fragments to recreate the 1918 flu virus"
Their tech is FREE for DNA synthesis companies
I always admire the unilateral provision of global public goods!
x.com/kesvelt/sta...
Remember the pandemic? That sucked...
We're doing MUCH less than we should be to prepare for the next one, but we do have a few heroes out there doing early detection wastewater monitoring 🙏
x.com/Simon__Grim...
Remember the pandemic? That sucked...
We're doing MUCH less than we should be to prepare for the next one, but we do have a few heroes out there doing early detection wastewater monitoring 🙏
x.com/Simon__Grim...
They provide short, intensive training programs that help people transition their careers into mechanistic interpretability and other AI safety work.
Check out the mentors listed in this thread – a true who's who of top safety researchers
x.com/ryan_kidd44...
They provide short, intensive training programs that help people transition their careers into mechanistic interpretability and other AI safety work.
Check out the mentors listed in this thread – a true who's who of top safety researchers
x.com/ryan_kidd44...
A well-rounded AI safety org that does research, red teams defense-in-depth systems, and supports international dialogue.
Their finding that "superhuman" GO AIs are vulnerable to adversarial attacks is a classic
Currently seeking a COO 👀
x.com/farairesear...
A well-rounded AI safety org that does research, red teams defense-in-depth systems, and supports international dialogue.
Their finding that "superhuman" GO AIs are vulnerable to adversarial attacks is a classic
Currently seeking a COO 👀
x.com/farairesear...
Remember the AI extinction risk statement Sam A, Dario, and Demis all signed?
That was @DanHendrycks and @ai_risks
A super interesting mix of work, spanning benchmarks, model representations research, and policy leadership
x.com/ai_risks/st...
Remember the AI extinction risk statement Sam A, Dario, and Demis all signed?
That was @DanHendrycks and @ai_risks
A super interesting mix of work, spanning benchmarks, model representations research, and policy leadership
x.com/ai_risks/st...
Known for "scary demos", they specialize in showing that, under the right circumstances, today's AIs sometimes behave very badly.
Exactly how to interpret these results is contested, but at the very least I'm glad we're talking about it!
x.com/PalisadeAI/...
Known for "scary demos", they specialize in showing that, under the right circumstances, today's AIs sometimes behave very badly.
Exactly how to interpret these results is contested, but at the very least I'm glad we're talking about it!
x.com/PalisadeAI/...
Most famous for their work on autonomous task completion, they also study models' ability to conduct ML research, assist in creation of bioweapons, and more.
Important questions!
x.com/METR_Evals/...
Most famous for their work on autonomous task completion, they also study models' ability to conduct ML research, assist in creation of bioweapons, and more.
Important questions!
x.com/METR_Evals/...
They work with AI labs to test models BEFORE release
Most recently, they tested whether OpenAI's Deliberative Alignment strategy can eliminate "scheming" behavior. (Spoiler: not quite)
I read their work immediately
x.com/OpenAI/stat...
They work with AI labs to test models BEFORE release
Most recently, they tested whether OpenAI's Deliberative Alignment strategy can eliminate "scheming" behavior. (Spoiler: not quite)
I read their work immediately
x.com/OpenAI/stat...
They provide expert advise & pro bono legal help to concerned insiders at AGI labs
Whistleblowers have already proven important
My experience on the GPT-4 Red Team makes me particularly passionate about this one!
twitter.com/AIWI_Offici...
They provide expert advise & pro bono legal help to concerned insiders at AGI labs
Whistleblowers have already proven important
My experience on the GPT-4 Red Team makes me particularly passionate about this one!
twitter.com/AIWI_Offici...
They even red-team their own analysis with frontier reasoning LLMs 💡
x.com/GiveWell/st...
They even red-team their own analysis with frontier reasoning LLMs 💡
x.com/GiveWell/st...
We gave to GiveDirectly first
x.com/GiveDirectl...
We gave to GiveDirectly first
x.com/GiveDirectl...