All views my own. He/him.
www.youtube.com/watch?v=lFAm...
In this episode, we cover the varying claims on the likelihood of 'The Singularity' hypothesis, and the risks of unfounded forecasting.
Full episode up now: kairos.fm/intoaisafety...
In this episode, we cover the varying claims on the likelihood of 'The Singularity' hypothesis, and the risks of unfounded forecasting.
Full episode up now: kairos.fm/intoaisafety...
@jacobhaimes.bsky.social & @thegermanpole.bsky.social break down how instruction tuning/RLHF create anthropomorphic chatbots that exploit human empathy—leading to mental health harms. kairos.fm/muckraikers/...
Find us wherever you listen! (links in thread)
@jacobhaimes.bsky.social & @thegermanpole.bsky.social break down how instruction tuning/RLHF create anthropomorphic chatbots that exploit human empathy—leading to mental health harms. kairos.fm/muckraikers/...
Find us wherever you listen! (links in thread)
Final scores were 4 5 4 4, i.e. all reviewers and the AC agreed the paper was an Accept.
Absolutely wild.
Final scores were 4 5 4 4, i.e. all reviewers and the AC agreed the paper was an Accept.
Absolutely wild.
Li-Lian Ang from BlueDot Impact discusses their evolution from broad AI safety courses to targeted impact acceleration, addressing elitism in the field, and why we need more advocates beyond just technical researchers.
kairos.fm/intoaisafety/e023
Li-Lian Ang from BlueDot Impact discusses their evolution from broad AI safety courses to targeted impact acceleration, addressing elitism in the field, and why we need more advocates beyond just technical researchers.
kairos.fm/intoaisafety/e023
Working with @jacyanthis.bsky.social (and the team) has been fantastic, and I'd happily do it again. If you have the chance to work with him, don't pass it up!
We propose human agency as a new alignment target in HumanAgencyBench, made possible by AI simulation/evals. We find e.g., Claude most supports agency but also most tries to steer user values 👇 arxiv.org/abs/2509.08494
Working with @jacyanthis.bsky.social (and the team) has been fantastic, and I'd happily do it again. If you have the chance to work with him, don't pass it up!
I chatted with Andres Sepulveda Morales, founder of Red Mage Creative and organizer of the Fort Collins Rocky Mountain AI Interest Group about surviving the tech layoff cycle, dark patterns in AI, and building inclusive AI communities.
I chatted with Andres Sepulveda Morales, founder of Red Mage Creative and organizer of the Fort Collins Rocky Mountain AI Interest Group about surviving the tech layoff cycle, dark patterns in AI, and building inclusive AI communities.
Will Petillo from PauseAI joins to discuss the grassroots movement for pausing frontier AI development, balancing diverse perspectives in activism, and why meaningful AI governance requires both political engagement and public support kairos.fm/intoaisafety...
Will Petillo from PauseAI joins to discuss the grassroots movement for pausing frontier AI development, balancing diverse perspectives in activism, and why meaningful AI governance requires both political engagement and public support kairos.fm/intoaisafety...
Find us on Spotify, Apple Podcasts, YouTube, or wherever you listen (links in thread).
Find us on Spotify, Apple Podcasts, YouTube, or wherever you listen (links in thread).
You can find the show on Spotify, Apple Podcasts, YouTube, or wherever else you listen (links in thread).
You can find the show on Spotify, Apple Podcasts, YouTube, or wherever else you listen (links in thread).
Listen on Spotify, Apple Podcasts, YouTube, or wherever you get your podcasts!
Listen on Spotify, Apple Podcasts, YouTube, or wherever you get your podcasts!
Listen on Spotify, Apple Podcasts, YouTube, or wherever you get your podcasts!
Listen on Spotify, Apple Podcasts, YouTube, or wherever you get your podcasts!
"The US has also demanded that the final statement excludes any mention of the environmental cost of AI, existential risk or the UN." - www.thetimes.com/article/a7ae...
☹️
"The US has also demanded that the final statement excludes any mention of the environmental cost of AI, existential risk or the UN." - www.thetimes.com/article/a7ae...
☹️
Developments are ongoing, but if you want a good 15 minute overview of new so far, check out kairos.fm/muckraikers/... or find us wherever you listen!
Developments are ongoing, but if you want a good 15 minute overview of new so far, check out kairos.fm/muckraikers/... or find us wherever you listen!
Check it out for one way to actively monitor one kind of AI misuse: LLM-based cyberattacks.
www.apartresearch.com/post/hunting...
Check it out for one way to actively monitor one kind of AI misuse: LLM-based cyberattacks.
www.apartresearch.com/post/hunting...
You can find the show on Spotify, Apple Podcasts, YouTube, or wherever else you listen.
You can find the show on Spotify, Apple Podcasts, YouTube, or wherever else you listen.
www.youtube.com/watch?v=lFAm...
www.youtube.com/watch?v=lFAm...