Tim Kellogg
@timkellogg.me
AI Architect | North Carolina | AI/ML, IoT, science
WARNING: I talk about kids sometimes
WARNING: I talk about kids sometimes
Pinned
Tim Kellogg
@timkellogg.me
· Sep 29
Does AI Get Bored?
timkellogg.me
Does AI get bored?
I gave them nothing to do, just to see what happens
one thing — they devolve into a repetitive “collapse” state, I guess you could call it boredom
but some break out into math & poetry on their own, I didn’t expect which ones that would be
timkellogg.me/blog/2025/09...
I gave them nothing to do, just to see what happens
one thing — they devolve into a repetitive “collapse” state, I guess you could call it boredom
but some break out into math & poetry on their own, I didn’t expect which ones that would be
timkellogg.me/blog/2025/09...
let’s have a moment of silence for the incidents we lost control of
Thank you for your software-as-a-service. 🫡
November 11, 2025 at 5:05 PM
let’s have a moment of silence for the incidents we lost control of
me? oh yes i’m a veteran, if bad data architecture
November 11, 2025 at 4:22 PM
me? oh yes i’m a veteran, if bad data architecture
tl;dr: it was too long, i didn’t read it but here’s what i think it should say
November 11, 2025 at 4:06 PM
tl;dr: it was too long, i didn’t read it but here’s what i think it should say
interesting paper
imo if base models perform the same at high pass@k, then RLVR is just making them better *agents*, bc the reduced error rate translates to long agent trajectories
so while there is limits to RLVR, it’s clearly necessary
limit-of-rlvr.github.io
imo if base models perform the same at high pass@k, then RLVR is just making them better *agents*, bc the reduced error rate translates to long agent trajectories
so while there is limits to RLVR, it’s clearly necessary
limit-of-rlvr.github.io
November 11, 2025 at 2:12 PM
interesting paper
imo if base models perform the same at high pass@k, then RLVR is just making them better *agents*, bc the reduced error rate translates to long agent trajectories
so while there is limits to RLVR, it’s clearly necessary
limit-of-rlvr.github.io
imo if base models perform the same at high pass@k, then RLVR is just making them better *agents*, bc the reduced error rate translates to long agent trajectories
so while there is limits to RLVR, it’s clearly necessary
limit-of-rlvr.github.io
oh! teor likes it! congrats “Alexander” @dorialexander.bsky.social 🤣
November 11, 2025 at 12:17 PM
oh! teor likes it! congrats “Alexander” @dorialexander.bsky.social 🤣
chat, what are we thinking: quantization or batch size?
November 11, 2025 at 12:15 PM
chat, what are we thinking: quantization or batch size?
80 layers — for those not paying attention, @dorialexander.bsky.social has been posting for weeks about how small models with deep rather than wide layers exhibit eerie emergent behavior
this one is worth checking out
this one is worth checking out
Synthetic playgrounds enabled a series of controlled experiments that brought us to favor extreme depth design. We selected a 80-layers architecture for Baguettotron, with improvements across the board on memorization of logical reasoning: huggingface.co/PleIAs/Bague...
November 11, 2025 at 2:26 AM
80 layers — for those not paying attention, @dorialexander.bsky.social has been posting for weeks about how small models with deep rather than wide layers exhibit eerie emergent behavior
this one is worth checking out
this one is worth checking out
Obama: people should sell their old cars
Trump: people should sell their souls
Trump: people should sell their souls
The US Trump Administration is reportedly working on 15 year car loans.
US President Donald Trump also posted yesterday regarding allowing 50 year mortgages.
US President Donald Trump also posted yesterday regarding allowing 50 year mortgages.
November 10, 2025 at 10:37 PM
Obama: people should sell their old cars
Trump: people should sell their souls
Trump: people should sell their souls
readwren: get AI to mimic your writing style
it gives you a short interview and hands back a reusable prompt that you can paste anywhere
uses K2-Thinking & a few other systems
github.com/muratcankoyl...
it gives you a short interview and hands back a reusable prompt that you can paste anywhere
uses K2-Thinking & a few other systems
github.com/muratcankoyl...
GitHub - muratcankoylan/readwren: An adaptive multi-agent system that extracts your literary DNA through conversation and generates actionable reading profiles.
An adaptive multi-agent system that extracts your literary DNA through conversation and generates actionable reading profiles. - muratcankoylan/readwren
github.com
November 10, 2025 at 8:00 PM
readwren: get AI to mimic your writing style
it gives you a short interview and hands back a reusable prompt that you can paste anywhere
uses K2-Thinking & a few other systems
github.com/muratcankoyl...
it gives you a short interview and hands back a reusable prompt that you can paste anywhere
uses K2-Thinking & a few other systems
github.com/muratcankoyl...
Reposted by Tim Kellogg
Listen, the technology & alternative approach to LLM training is interesting & all, but can we focus on the most important detail, which is that the larger model is named "Baguettotron?"
It’s named Baguettotron, people.
BAGUETTOTRON.
It’s named Baguettotron, people.
BAGUETTOTRON.
Breaking: we release a fully synthetic generalist dataset for pretraining, SYNTH and two new SOTA reasoning models exclusively trained on it. Despite having seen only 200 billion tokens, Baguettotron is currently best-in-class in its size range. pleias.fr/blog/blogsyn...
November 10, 2025 at 5:34 PM
Listen, the technology & alternative approach to LLM training is interesting & all, but can we focus on the most important detail, which is that the larger model is named "Baguettotron?"
It’s named Baguettotron, people.
BAGUETTOTRON.
It’s named Baguettotron, people.
BAGUETTOTRON.
who said you couldn’t get a job by asking really good questions
Thrilled to welcome @fofrAI to Google DeepMind and the AI Studio team as a Senior Prompt Engineer!!
Our generative media models (Veo, Nano Banana, Lyria, etc) are reshaping the industry, such a special moment to make sure everyone can get the most out of them, and more!
Our generative media models (Veo, Nano Banana, Lyria, etc) are reshaping the industry, such a special moment to make sure everyone can get the most out of them, and more!
November 10, 2025 at 7:01 PM
who said you couldn’t get a job by asking really good questions
but is it still learning if you got it from an LLM?
November 10, 2025 at 12:34 PM
but is it still learning if you got it from an LLM?
sent this to my brother asking, “does this count as wealth redistribution?”
(fun fact: my bro voted for Trump and is also undergoing collapse of the company he’s CEO of due to tariffs)
(fun fact: my bro voted for Trump and is also undergoing collapse of the company he’s CEO of due to tariffs)
November 9, 2025 at 9:36 PM
sent this to my brother asking, “does this count as wealth redistribution?”
(fun fact: my bro voted for Trump and is also undergoing collapse of the company he’s CEO of due to tariffs)
(fun fact: my bro voted for Trump and is also undergoing collapse of the company he’s CEO of due to tariffs)
ok now i’m just curious
sex workers who make their money on the internet often hang out being cool people on the internet to generate traffic on a sfw account.
if you have zero sex workers as mutuals it means you're using the platform you're on like linkedin and checking their resumes in advance
if you have zero sex workers as mutuals it means you're using the platform you're on like linkedin and checking their resumes in advance
one comment on the Talarico Instagram non scandal: sex workers are allowed to have lives and interests outside of work, too, including politics.
feel like bsky is preaching to the choir in that regard, but
feel like bsky is preaching to the choir in that regard, but
November 9, 2025 at 9:09 PM
ok now i’m just curious
The town of German, NY elected 2 positions on write-in ballots alone
1. Superintendent of Highways
2. Town Justice
apparently no one ran
1. Superintendent of Highways
2. Town Justice
apparently no one ran
November 9, 2025 at 9:06 PM
The town of German, NY elected 2 positions on write-in ballots alone
1. Superintendent of Highways
2. Town Justice
apparently no one ran
1. Superintendent of Highways
2. Town Justice
apparently no one ran
social media is RL on humans
It's interesting that RLHF'd LLMs and influencers talk the same way. Perhaps through the evolution of clickbait, we'd already found the local maximum of attention grabbing
November 9, 2025 at 8:24 PM
social media is RL on humans
most engineers probably don’t understand the extent to which “tech debt” works as an abstraction
there’s absolutely “good tech debt”, and it’s not always obvious which you’re dealing with
if you’re green fielding an app, you should expect to launch with some amount of tech debt
there’s absolutely “good tech debt”, and it’s not always obvious which you’re dealing with
if you’re green fielding an app, you should expect to launch with some amount of tech debt
November 9, 2025 at 5:20 PM
most engineers probably don’t understand the extent to which “tech debt” works as an abstraction
there’s absolutely “good tech debt”, and it’s not always obvious which you’re dealing with
if you’re green fielding an app, you should expect to launch with some amount of tech debt
there’s absolutely “good tech debt”, and it’s not always obvious which you’re dealing with
if you’re green fielding an app, you should expect to launch with some amount of tech debt
why has nobody bothered to ask the hard questions that matter??
like, how can we appear to address income inequality while actually shifting even more wealth to the ultra rich?
like, how can we appear to address income inequality while actually shifting even more wealth to the ultra rich?
November 9, 2025 at 2:47 PM
why has nobody bothered to ask the hard questions that matter??
like, how can we appear to address income inequality while actually shifting even more wealth to the ultra rich?
like, how can we appear to address income inequality while actually shifting even more wealth to the ultra rich?
this is true
The pope: “you should probably be a good person”
Marc Andreessen: “this is an attack on me and everything I stand for”
Marc Andreessen: “this is an attack on me and everything I stand for”
November 9, 2025 at 2:25 PM
this is true
Polaris Alpha, believed to be GPT-5.1 non-reasoning, scores just below Sonnet 4.5 on HLE (unofficial run)
There will be a reasoning version too, and OpenAI excels at RL & post training, so I have high expectations for it
also leaked: Nov 24 release date
There will be a reasoning version too, and OpenAI excels at RL & post training, so I have high expectations for it
also leaked: Nov 24 release date
November 9, 2025 at 2:18 PM
Polaris Alpha, believed to be GPT-5.1 non-reasoning, scores just below Sonnet 4.5 on HLE (unofficial run)
There will be a reasoning version too, and OpenAI excels at RL & post training, so I have high expectations for it
also leaked: Nov 24 release date
There will be a reasoning version too, and OpenAI excels at RL & post training, so I have high expectations for it
also leaked: Nov 24 release date
idk is a 50 year mortgage even worth it?
November 8, 2025 at 10:45 PM
idk is a 50 year mortgage even worth it?
kimi-writer: an agent that writes long-form fiction using K2-Thinking
it writes files using tools, auto-compacts context when it grows too big
seems like the convergence of high-quality writing and agency
github.com/Doriandarko/...
it writes files using tools, auto-compacts context when it grows too big
seems like the convergence of high-quality writing and agency
github.com/Doriandarko/...
GitHub - Doriandarko/kimi-writer: AI writing agent powered by kimi-k2-thinking - autonomously creates novels and stories with deep reasoning
AI writing agent powered by kimi-k2-thinking - autonomously creates novels and stories with deep reasoning - Doriandarko/kimi-writer
github.com
November 8, 2025 at 10:01 PM
kimi-writer: an agent that writes long-form fiction using K2-Thinking
it writes files using tools, auto-compacts context when it grows too big
seems like the convergence of high-quality writing and agency
github.com/Doriandarko/...
it writes files using tools, auto-compacts context when it grows too big
seems like the convergence of high-quality writing and agency
github.com/Doriandarko/...
Reposted by Tim Kellogg
An increasingly likely future is that China reaps the benefits of the world's most advanced AI systems
Trained on Chinese chips
Using free Chinese renewable energy
Because the American right turned it's back on low-cost clean energy and free trade
And the American left turned it's back on AI
Trained on Chinese chips
Using free Chinese renewable energy
Because the American right turned it's back on low-cost clean energy and free trade
And the American left turned it's back on AI
Did you know a former White House climate advisor flipped a GOP stronghold in Virginia by running entirely on putting a stop to more AI data centers?
For @heatmap.news I profiled John McAuliff and a campaign that will be a roadmap for all future anti-AI politicians moving forward.
For @heatmap.news I profiled John McAuliff and a campaign that will be a roadmap for all future anti-AI politicians moving forward.
This Virginia Election Was a Warning for Data Centers
John McAuliff ran his campaign almost entirely on data centers — and won.
heatmap.news
November 8, 2025 at 3:46 PM
An increasingly likely future is that China reaps the benefits of the world's most advanced AI systems
Trained on Chinese chips
Using free Chinese renewable energy
Because the American right turned it's back on low-cost clean energy and free trade
And the American left turned it's back on AI
Trained on Chinese chips
Using free Chinese renewable energy
Because the American right turned it's back on low-cost clean energy and free trade
And the American left turned it's back on AI
GPT-5-codex-mini
Almost same performance as GPT-5-codex on high, but 4x faster and without pesky things like warm personality
www.neowin.net/amp/openai-i...
Almost same performance as GPT-5-codex on high, but 4x faster and without pesky things like warm personality
www.neowin.net/amp/openai-i...
November 8, 2025 at 4:46 PM
GPT-5-codex-mini
Almost same performance as GPT-5-codex on high, but 4x faster and without pesky things like warm personality
www.neowin.net/amp/openai-i...
Almost same performance as GPT-5-codex on high, but 4x faster and without pesky things like warm personality
www.neowin.net/amp/openai-i...