Lightnews — Scholar-powered news

Rishub Jain

@shubadubadub.bsky.social

560 followers 100 following 11 posts

Works at Google DeepMind on Safe+Ethical AI

Posts Replies Media Videos

Reposted by Rishub Jain

David Lindner

@davidlindner.bsky.social

New Google DeepMind safety paper! LLM agents are coming – how do we stop them finding complex plans to hack the reward?

Our method, MONA, prevents many such hacks, *even if* humans are unable to detect them!

Inspired by myopic optimization but better performance – details in🧵

January 23, 2025 at 3:33 PM

Rishub Jain

@shubadubadub.bsky.social

How do we ensure humans can still effectively oversee increasingly powerful AI systems? In our blog, we argue that achieving Human-AI complementarity is an underexplored yet vital piece of this puzzle! And, it’s hard, but we achieved it.

🧵(1/10)

December 24, 2024 at 12:01 AM

Reposted by Rishub Jain

brunost.bsky.social

@brunost.bsky.social

Can someone let me into Croatia’s inside joke

May 13, 2023 at 9:14 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news