ag
ninerealmlabs.com
ag
@ninerealmlabs.com
Build Agents in 2025, RL tune them in 2026. GPT‑5 economics (~15x cheaper output, 2–4x faster than 4.5; per Epoch AI) signal a shift from naive scale to post‑training RL.

What does this mean for reliable, long‑horizon agents? My take: aimlbling-about.ninerealmlabs.com/blog/reinfor...
Reinforcement Learning
I had intended to start this post by proclaiming “2026 will be the year of reinforcement learning” as 2025 is “the year of agents”… But model and research releases over the past several weeks indicate...
aimlbling-about.ninerealmlabs.com
August 11, 2025 at 10:27 PM
What do AI sycophancy, the Chatbot Arena, and the Pepsi Challenge have in common?

aimlbling-about.ninerealmlabs.com/blog/sycopha...

Consider the Pepsi Challenge as a parable for short-term preference (over)optimization that contributes to less reliable and trustworthy experiences.
Sycophancy, Planning, and the Pepsi Challenge
Sycophancy On April 25th, we [OpenAI] rolled out an update to GPT‑4o in ChatGPT that made the model noticeably more sycophantic. It aimed to please the user, not just as flattery, but also as v...
aimlbling-about.ninerealmlabs.com
July 11, 2025 at 12:37 PM
Keeping up with AI feels like a treadmill on max speed.
I been sharing a biweekly newsletter to help my team stay sane—now I’m publishing it more broadly.
Artisanal, curated, human-written insights (no AI fluff).
Catch your breath:

aimlbling-about.ninerealmlabs.com/treadmill
AI Treadmill
Keeping up with AI feels like sprinting uphill on a maxed-out treadmill
aimlbling-about.ninerealmlabs.com
May 30, 2025 at 10:13 PM
Superintelligence Strategy proposes Mutual Assured AI Malfunction (MAIM) as a model for AI national security—borrowing from Mutual Assured Destruction (MAD). The problem:

AI is an arms race, not a deterrent, even though risk potential is real: aimlbling-about.ninerealmlabs.com/blog/maim-is...
MAIM Is MADness
Superintelligence Strategy is a policy paper by Dan Hendrycks (Director, Center for AI Safety), Eric Schmidt (former CEO, Google), and Alexandr Wang (CEO, Scale AI), that proposes a three-part framewo...
aimlbling-about.ninerealmlabs.com
March 10, 2025 at 12:57 PM
The Bitter Lesson, somewhat confusingly, is a parable with an optimistic view of how machine learning and AI systems improve with scale, yet might leave us all with a bad taste in our mouths.

aimlbling-about.ninerealmlabs.com/blog/the-bit...
February 17, 2025 at 3:29 PM
In 2024, I shared ~2000 AI resources (40% of which were arXiv papers!) in an internal newsletter I curate. 
No wonder staying up-to-date feels like running on a treadmill cranked all the way up!

Details in my blog: aimlbling-about.ninerealmlabs.com/blog/the-ai-...
The AI Treadmill (2024)
At the end of 2023, I was transferred to the team working on PMI Infinity, an AI-powered tool and product for Project Managers, to act as an AI/ML Engineer and provide expertise on the AI/ML aspects o...
aimlbling-about.ninerealmlabs.com
January 1, 2025 at 2:00 PM