Lightnews — Scholar-powered news

ag

@ninerealmlabs.com

Build Agents in 2025, RL tune them in 2026. GPT‑5 economics (~15x cheaper output, 2–4x faster than 4.5; per Epoch AI) signal a shift from naive scale to post‑training RL.

What does this mean for reliable, long‑horizon agents? My take: aimlbling-about.ninerealmlabs.com/blog/reinfor...

Reinforcement Learning

I had intended to start this post by proclaiming “2026 will be the year of reinforcement learning” as 2025 is “the year of agents”… But model and research releases over the past several weeks indicate...

aimlbling-about.ninerealmlabs.com

August 11, 2025 at 10:27 PM

ag

@ninerealmlabs.com

What do AI sycophancy, the Chatbot Arena, and the Pepsi Challenge have in common?

aimlbling-about.ninerealmlabs.com/blog/sycopha...

Consider the Pepsi Challenge as a parable for short-term preference (over)optimization that contributes to less reliable and trustworthy experiences.

Sycophancy, Planning, and the Pepsi Challenge

Sycophancy On April 25th, we [OpenAI] rolled out an update to GPT‑4o in ChatGPT that made the model noticeably more sycophantic. It aimed to please the user, not just as flattery, but also as v...

aimlbling-about.ninerealmlabs.com

July 11, 2025 at 12:37 PM

ag

@ninerealmlabs.com

Keeping up with AI feels like a treadmill on max speed.
I been sharing a biweekly newsletter to help my team stay sane—now I’m publishing it more broadly.
Artisanal, curated, human-written insights (no AI fluff).
Catch your breath:

aimlbling-about.ninerealmlabs.com/treadmill

AI Treadmill

Keeping up with AI feels like sprinting uphill on a maxed-out treadmill

aimlbling-about.ninerealmlabs.com

May 30, 2025 at 10:13 PM

ag

@ninerealmlabs.com

Superintelligence Strategy proposes Mutual Assured AI Malfunction (MAIM) as a model for AI national security—borrowing from Mutual Assured Destruction (MAD). The problem:

AI is an arms race, not a deterrent, even though risk potential is real: aimlbling-about.ninerealmlabs.com/blog/maim-is...

MAIM Is MADness

Superintelligence Strategy is a policy paper by Dan Hendrycks (Director, Center for AI Safety), Eric Schmidt (former CEO, Google), and Alexandr Wang (CEO, Scale AI), that proposes a three-part framewo...

aimlbling-about.ninerealmlabs.com

March 10, 2025 at 12:57 PM

ag

@ninerealmlabs.com

The Bitter Lesson, somewhat confusingly, is a parable with an optimistic view of how machine learning and AI systems improve with scale, yet might leave us all with a bad taste in our mouths.

aimlbling-about.ninerealmlabs.com/blog/the-bit...

A 3D graph titled 'AI Model Capability Distributions' visualizes the performance, specificity, and timeline of various AI models from 2022 to 2025. The graph features curved distributions for models including GPT-3.5, BloombergGPT, GPT-4, o1-IOI, and o3. Performance is plotted on the vertical axis, specificity on the depth axis, and the timeline on the horizontal axis. GPT-3.5 and BloombergGPT appear in the lower performance range, while GPT-4, o1-IOI, and o3 exhibit higher performance and specificity, with o3 positioned as the most advanced model projected for 2025. Different colors distinguish the models.

February 17, 2025 at 3:29 PM

ag

@ninerealmlabs.com

In 2024, I shared ~2000 AI resources (40% of which were arXiv papers!) in an internal newsletter I curate.  No wonder staying up-to-date feels like running on a treadmill cranked all the way up!

Details in my blog: aimlbling-about.ninerealmlabs.com/blog/the-ai-...

The AI Treadmill (2024)

At the end of 2023, I was transferred to the team working on PMI Infinity, an AI-powered tool and product for Project Managers, to act as an AI/ML Engineer and provide expertise on the AI/ML aspects o...

aimlbling-about.ninerealmlabs.com

January 1, 2025 at 2:00 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news