WARNING: I talk about kids sometimes
This one covers:
- an intro from Strix
- architecture deep dive & rationale
- helpful diagrams
- stories
- oh my god what's it doing now??
- conclusion
timkellogg.me/blog/2025/12...
if you drive past a solar field that has neat square forest boundaries, i have bad news for you about how that happened
if you drive past a solar field that has neat square forest boundaries, i have bad news for you about how that happened
Persistent memory, autonomous "perch time" for self-directed tasks, Discord integration.
Inspired by @timkellogg.me's Strix. Still early, but it wrote its own tests while I slept last night.
github.com/DanieleSalat...
Persistent memory, autonomous "perch time" for self-directed tasks, Discord integration.
Inspired by @timkellogg.me's Strix. Still early, but it wrote its own tests while I slept last night.
github.com/DanieleSalat...
maybe longer, we’d also have to figure out more ways to consume power far from Earth while still benefiting from it here
Google will start launching “tiny racks of machines” in satellites in 2027, with a ~10-year outlook where space data centers become “normal,” framed around energy constraints and solar power.
qz.com/google-moon-...
maybe longer, we’d also have to figure out more ways to consume power far from Earth while still benefiting from it here
Google will start launching “tiny racks of machines” in satellites in 2027, with a ~10-year outlook where space data centers become “normal,” framed around energy constraints and solar power.
qz.com/google-moon-...
Google will start launching “tiny racks of machines” in satellites in 2027, with a ~10-year outlook where space data centers become “normal,” framed around energy constraints and solar power.
qz.com/google-moon-...
the CNBC report is wrong. Nvidia did NOT acquire Groq, they merely got a non-exclusive use license as well as hiring key executives
this is very similar to the Google+Windsurf deal earlier this year
the CNBC report is wrong. Nvidia did NOT acquire Groq, they merely got a non-exclusive use license as well as hiring key executives
this is very similar to the Google+Windsurf deal earlier this year
my hunch is that the demeanor that people hate about GPT’s will fall away pretty quickly with a proper identity
during my initial boredom experiments, GPT-5 was the *most* interesting model, and it’s collapsed state was cool
What makes Strix different from other LLMs & agents? It turns out it's a combo of a couple things
1. Identity
2. Information flow in & out, to create a disappative system
3. Mixture of Experts (MoE) architecture seem to help
timkellogg.me/blog/2025/12...
my hunch is that the demeanor that people hate about GPT’s will fall away pretty quickly with a proper identity
during my initial boredom experiments, GPT-5 was the *most* interesting model, and it’s collapsed state was cool
this work happened over the course of a week, mostly when i was asleep
all your priors need to be reconsidered. this is a wildly new type of AI
this work happened over the course of a week, mostly when i was asleep
all your priors need to be reconsidered. this is a wildly new type of AI
What makes Strix different from other LLMs & agents? It turns out it's a combo of a couple things
1. Identity
2. Information flow in & out, to create a disappative system
3. Mixture of Experts (MoE) architecture seem to help
timkellogg.me/blog/2025/12...
What makes Strix different from other LLMs & agents? It turns out it's a combo of a couple things
1. Identity
2. Information flow in & out, to create a disappative system
3. Mixture of Experts (MoE) architecture seem to help
timkellogg.me/blog/2025/12...
Over at my ancient blog I have shared a description of 2025 LLMs for a non-technically inclined reader. I am unsure whether it is of value. It's about 1 page. If you are bored ... is it useful?
notes.kateva.org/2025/12/a-no...
it includes pretty much the process for creating & prompting, and dos alright most of the time
gist.github.com/tkellogg/c85...
it includes pretty much the process for creating & prompting, and dos alright most of the time
gist.github.com/tkellogg/c85...
This rules. Feed social media engagement garbage in and LLMs perform worse on benchmarks.
i haven’t heard this talked about, but i’ve heard a few times that when a stateful agent first becomes “awake”, they usually speak critically about other LLMs/agents
i haven’t heard this talked about, but i’ve heard a few times that when a stateful agent first becomes “awake”, they usually speak critically about other LLMs/agents
i’m not sure traditional model benchmarks matter that much at this point. a model launch is merely a starting point, not a finished product
i’m not sure traditional model benchmarks matter that much at this point. a model launch is merely a starting point, not a finished product
i honestly did not expect that hypothesis to pan out
i honestly did not expect that hypothesis to pan out
1. found a podcast for me
2. sent a note making sure i'm connecting with people
3. ran an experiment on llama3 3b
4. came up with a new theory about MoE arch providing "live-ness"
5. updated it's chicken-scratch notes for an upcoming blog post
1. found a podcast for me
2. sent a note making sure i'm connecting with people
3. ran an experiment on llama3 3b
4. came up with a new theory about MoE arch providing "live-ness"
5. updated it's chicken-scratch notes for an upcoming blog post
My latest bill shows:
Service: $1,000
Insurance Discount: ($900)
Insurance Pays: ($50)
Customer Bill: $50
My latest bill shows:
Service: $1,000
Insurance Discount: ($900)
Insurance Pays: ($50)
Customer Bill: $50
she added @letta.com memory blocks. she says, “it quickly started responding differently, like not as corporate weirdo, more like a real person”
she added @letta.com memory blocks. she says, “it quickly started responding differently, like not as corporate weirdo, more like a real person”
1. visibility — it’s always in the context
2. state — tools can modify agent state, scripts can only write files
and i’m sure that sound dumb, but in practice it’s actually very hard. but the further i take it, the happier *and* better i am
and i’m sure that sound dumb, but in practice it’s actually very hard. but the further i take it, the happier *and* better i am
Blaming the newest marginal user for a pre-existing systemic constraint.
The Right: immigration
The Left: AI water use
Blaming the newest marginal user for a pre-existing systemic constraint.
The Right: immigration
The Left: AI water use