Lightnews — Scholar-powered news

Jason

@jas-ho.bsky.social

28 followers 200 following 1 posts

Co-Director Apart Research | apartresearch.com

Posts Replies Media Videos

Jason

@jas-ho.bsky.social

Aligning AIs is hard and even knowing what to aim for is non-trivial. Very excited about the work by @jacyanthis.bsky.social
and team on this important problem and very proud that
@apartresearch.bsky.social
was able to support this project!

Jacy Reese Anthis @jacyanthis.bsky.social · Sep 15

LLM agents are optimized for thumbs-up instant gratification. RLHF -> sycophancy

We propose human agency as a new alignment target in HumanAgencyBench, made possible by AI simulation/evals. We find e.g., Claude most supports agency but also most tries to steer user values 👇 arxiv.org/abs/2509.08494

The main figure from the HumanAgencyBench paper, showing five models across the six dimensions. The table of results in the appendix has this information too.

September 16, 2025 at 10:49 AM

Reposted by Jason

Ethan Mollick

@emollick.bsky.social

If you wanted to see how little attention folks are paying to the possibility of AGI (however defined) no matter how much the labs publicly discuss it, here is an official course from Google Deepmind whose first session is "we are on a path to superhuman capabilities"

It has less than 1,000 views.

April 3, 2025 at 3:05 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news