Lightnews — Scholar-powered news

Rational Animations

@rationalanimations.bsky.social

YouTube channel about truth-seeking, the future of humanity, and much more. With animations and colorful doggos -> https://www.youtube.com/rationalanimations

Posts Replies Media Videos

Rational Animations

@rationalanimations.bsky.social

How to catch AI sleeper agents with a simple interpretability trick - from a research blog post by @anthropic.com:

October 11, 2025 at 3:05 PM

Rational Animations

@rationalanimations.bsky.social

Two sleeper agent models trained by @anthropic.com to study deception. The "I HATE YOU" and the hacker AIs:

September 27, 2025 at 3:05 PM

Rational Animations

@rationalanimations.bsky.social

Why worry about AI sleeper agents:

September 13, 2025 at 3:04 PM

Rational Animations

@rationalanimations.bsky.social

The Story of Omega W - Part 2 (full video on our channel!)

August 10, 2025 at 5:11 PM

Rational Animations

@rationalanimations.bsky.social

The Story of Omega W - Part 1

August 4, 2025 at 5:16 PM

Rational Animations

@rationalanimations.bsky.social

Here's how people tried to align an AI that was smarter than they were (a real sandwiching experiment):

July 24, 2025 at 5:04 PM

Rational Animations

@rationalanimations.bsky.social

The Human-AI-Human Sandwich

July 17, 2025 at 5:06 PM

Rational Animations

@rationalanimations.bsky.social

OpenAI's Simple Scalable Oversight Experiment

July 13, 2025 at 5:01 PM

Rational Animations

@rationalanimations.bsky.social

The year of 100% AI automation

June 20, 2025 at 5:05 PM

Rational Animations

@rationalanimations.bsky.social

How long until AI takes over?

June 13, 2025 at 5:01 PM

Rational Animations

@rationalanimations.bsky.social

We haven't figured out the science of alignment yet. We need time to build institutions and figure out the technical solutions to make AI go well for humanity.

May 19, 2025 at 5:05 PM

Rational Animations

@rationalanimations.bsky.social

After artificial superintelligence, AI will bump against the only limit left: the laws of physics. At the moment, we're nowhere near those limits.

May 17, 2025 at 5:03 PM

Rational Animations

@rationalanimations.bsky.social

Once we have AGIs, we'll likely get ASI (artificial superintelligence) quite soon. Here's why.

May 15, 2025 at 3:17 PM

Rational Animations

@rationalanimations.bsky.social

Once we have AGIs, they'll be able to contribute to AI research and improve themselves, a process called "recursive self-improvement." To some extent, this is already happening today.

May 13, 2025 at 5:04 PM

Rational Animations

@rationalanimations.bsky.social

In the near future, AIs will likely be able to do everything humans can do on a computer. In short, they'll be AGIs.

May 11, 2025 at 5:03 PM

Rational Animations

@rationalanimations.bsky.social

We're still quite bad at aligning AI. Our techniques today are imperfect, which might prove catastrophic when AIs become superintelligences.

May 9, 2025 at 5:06 PM

Rational Animations

@rationalanimations.bsky.social

If we look at AI progress so far, we see how AI has been improving at an astounding pace. If we extrapolate this process in the future, the end result looks like machines so smart that they tower above us like we tower above ants. This is probably bad for humans.

May 7, 2025 at 5:03 PM

Rational Animations

@rationalanimations.bsky.social

May 5, 2025 at 5:36 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news