Tim Kellogg
banner
timkellogg.me
Tim Kellogg
@timkellogg.me
AI Architect | North Carolina | AI/ML, IoT, science

WARNING: I talk about kids sometimes
on the next page it says all chat bots must answer to the name “big brother”
November 11, 2025 at 9:18 PM
i tried doing a startup doing exactly this with KGs, it did not pan out
November 11, 2025 at 4:07 PM
ya that’s where i’m at too. it feels strange watching someone use a sub-par mode
November 11, 2025 at 4:01 PM
my suspicion is that Google doesn’t do this and that they might be the only ones that don’t
November 11, 2025 at 3:19 PM
the longer i think about it, the more i suspect they’re doing something at an even higher level

like instead of dynamic batch sizing, maybe they do constant and just have very smart load balancers that keep load saturated

probably balancing training & serving utilization
November 11, 2025 at 3:18 PM
it didn’t occur to me until now, but monad is in the range of HRM but generalized
November 11, 2025 at 2:37 PM
honestly surprised he didn’t do that years ago
November 11, 2025 at 1:51 PM
ya that was my thought too
November 11, 2025 at 12:32 PM
maybe, but the even “twice” makes me think it was something that more directly adds to 2x perf
November 11, 2025 at 12:31 PM
so sweet
November 11, 2025 at 3:27 AM
@dorialexander.bsky.social i wish you blessings in the form of billions of euros in funding
November 11, 2025 at 2:43 AM
i’m surprised! i expected them to train in fp32, but no, they went with a legit bf16
November 11, 2025 at 2:41 AM
while being the most French model yet, they had to rationalize why it wasn’t trained on French

but fr imagine being able to do ablations on THE ENTIRE end-to-end training process. you’d learn so much
November 11, 2025 at 2:36 AM
lol
November 10, 2025 at 11:50 PM
Google will also budget rewrites into their timelines, or so i've heard
November 10, 2025 at 10:31 PM