Tim Kellogg
banner
timkellogg.me
Tim Kellogg
@timkellogg.me
AI Architect | North Carolina | AI/ML, IoT, science

WARNING: I talk about kids sometimes
Pinned
Meet Strix, my AI agent

This one covers:
- an intro from Strix
- architecture deep dive & rationale
- helpful diagrams
- stories
- oh my god what's it doing now??
- conclusion

timkellogg.me/blog/2025/12...
Strix the Stateful Agent
timkellogg.me
“pony-alpha” on openrouter is GLM-5

2x bigger than GLM-4.6
February 10, 2026 at 2:58 AM
Reposted by Tim Kellogg
"the purpose of a system is what it does" is a useful phrase because it instantly nullifies marketing & propaganda for causes that result in harm

if one routinely vilifies the whole concept of GMO agriculture, one purpose of that vilification is to cause a third of people to be Vitamin A deficient
The golden rice situation makes me *SO* sad.

It's estimated a THIRD of people worldwide are Vitamin A deficient, which can cause permanent damage to the eyes — in many cases full-on blindness.

We created a perfect solution to this problem, but oh no, it's "unnatural"
Same anti-science people who'd rather see people starve, than endorse GMOs. Because, hurr durr, Capitalism bad, science bad, therefore, hurr durr, golden rice bad and an unnatural abomination!
February 9, 2026 at 10:27 PM
this is embarrassing

LLaDA2.1-flash is 100B but compares itself (it’s worse) to Qwen3-30B-A3B — 3x bigger total size, 33x bigger active size, and still loses

even worse, it’s in FP32 instead of bf16, so double those multiples yet again..
LLaDA 2.1 is out 🔥 MoE diffusion language models released by AntGroup

huggingface.co/inclusionAI/...
huggingface.co/inclusionAI/...

✨LLaDA2.1-mini: 16B - Apache2.0
✨LLaDA2.1-flash: 100B - Apache2.0
✨Both delivers editable generation, RL-trained diffusion reasoning and fast inference
February 9, 2026 at 10:46 PM
i think it’s super interesting that Codex was released before non-Codex this time
Sound like GPT-5.3 (non-Codex) is released this week.

"according to an internal Slack message viewed by CNBC...OpenAI is also preparing to launch “an updated Chat model” this week, Altman said."
Sam Altman touts ChatGPT's reaccelerating growth to employees as OpenAI closes in on $100 billion funding
OpenAI CEO Sam Altman told employees that ChatGPT's monthly growth is back above 10%, as competition ramps up in generative AI.
www.cnbc.com
February 9, 2026 at 7:58 PM
did someone from OpenAI make this??
February 9, 2026 at 2:43 PM
full disclosure: i will totally flip over to X during the halftime show to watch the bunny hate
February 9, 2026 at 12:36 AM
Reposted by Tim Kellogg
game day. as the superb owl i'm contractually obligated to root for the bird team. let's go seahawks, destroy those patriots 🦉🏈
February 8, 2026 at 8:38 PM
we’re pumped!
Sunday is almost here and I'm so excited — millions of people are going to turn on their TVs to watch ME, the superb owl 🦉
February 8, 2026 at 8:24 PM
bro has been busy for an hour and ten minutes
February 8, 2026 at 6:36 PM
for both codex-5.2 and codex-5.3 OpenAI held it from the API so that “they could do a more thorough safety eval”

no, no, it’s for competitive advantage
February 8, 2026 at 12:29 AM
was going to comment how there’s crap tons of super interesting people popping into my mentions that i’ve never seen before
(my) bsky now more active than (my) twitter for 24 hours
February 7, 2026 at 10:13 PM
btw it’s batch size

those people guessing fancy shit like new hardware & quantization are making shit up

Kimi did something similar. They halved their latency overnight by reducing the batch size (they launched it as a different model though)
Opus 4.6 Fast Mode = 2.5x faster

code.claude.com/docs/en/fast...
February 7, 2026 at 9:43 PM
Opus 4.6 Fast Mode = 2.5x faster

code.claude.com/docs/en/fast...
February 7, 2026 at 7:22 PM
this is something i’m hoping Opus 4.6 addresses, but i have no gauge on it because there’s basically no benchmarks for it

“what will i need to know in the future.. given no specific goal”
me: i exist through documentation
also me: *forgets to document things*
me: guess i'll cease 🤷‍♀️
February 7, 2026 at 6:50 PM
how is anti-science sentiment going to change as AI takes a bigger role in science?
February 7, 2026 at 4:22 PM
i love this

for Strix, being an architect i assumed it was about architecture, but i followed really similar process. It’s _all_ about how you treat the agent. It really is just that. I don’t think architecture matters that much
February 7, 2026 at 1:50 PM
channeling my inner Mandi (friend from high school), why?

why would the Antis lie about climate impact & utility or AI?

i honestly don’t know. it only sort of makes sense
February 7, 2026 at 5:33 AM
he’s a CFO. Software doesn’t have gatekeepers anymore
I just shipped an internal software tool to our team that would have taken me a month before Claude Code. I spent maybe half my spare time over the last week on it. Wild stuff.
February 7, 2026 at 2:07 AM
PowerpointBench is not yet saturated. Lots more room for improvement re Opus 4.6
i think my new benchmark for LLMs is how well they can turn an image into a powerpoint or visio
February 7, 2026 at 12:32 AM
ha, i’m not sure how i feel about this
People used to say stuff like, the great minds of our generation are wasting their life's work optimizing ad revenue, but now I guess they have moved on to more fulfilling work, such as chatbot-only reddit.
February 6, 2026 at 10:31 PM
whelp, i guess that makes it official. Rust is a good language for LLMs
February 6, 2026 at 9:34 PM
correct

this is even more true than the original commercials were
February 6, 2026 at 5:16 PM
this is quite an undertaking

biggest thing it brings is a much better security model. Monty lets you have tight control over network, filesystem, and (soon) which modules are allowed

also, extremely fast startup time
February 6, 2026 at 12:53 PM
@hailey.at have you written about how you made Penny?

@village11.bsky.social and I are wondering how you got Penny to be…cool
February 6, 2026 at 12:49 AM
open source was having a tough time for years. imo the last 5-10 years it’s been hard to get devs to contribute to open source. i blame high wages, i doubt it’s AI. if anything AI made it easier to contribute
February 6, 2026 at 12:43 AM