Writes at https://olickel.com
Here's a slice of a 4 HOUR run (~1 second per minute) with not much more than 'keep going' from me every 90 minutes or so.
moonshotai.github.io/Kimi-K2/
Here's a slice of a 4 HOUR run (~1 second per minute) with not much more than 'keep going' from me every 90 minutes or so.
moonshotai.github.io/Kimi-K2/
blog.cloudflare.com/introducing...
blog.cloudflare.com/introducing...
Here's Gemini Cli (with Sonnet 4) vs Claude Code (with and without subagents) fixing the same bug from the same prompt in the gemini-cli codebase:
www.notion.so/southbridge...
Here's Gemini Cli (with Sonnet 4) vs Claude Code (with and without subagents) fixing the same bug from the same prompt in the gemini-cli codebase:
www.notion.so/southbridge...
Holy shit there's a lot in there. Claude Code is NOT just Claude in a loop - there's so much to learn from.
www.notion.so/southbridge...
Holy shit there's a lot in there. Claude Code is NOT just Claude in a loop - there's so much to learn from.
www.notion.so/southbridge...
The key problems for modern LLM application design that get often overlooked (I think) are:
• Streaming outputs and partial parsing
• Context organization and management (I don't mean summarising at 90%)
The key problems for modern LLM application design that get often overlooked (I think) are:
• Streaming outputs and partial parsing
• Context organization and management (I don't mean summarising at 90%)
Verified on Qwen 3 - a30b (below)
Lots of interesting takeaways from the Random Rewards paper. NOT that RL is dead, but honestly far more interesting than that!
Verified on Qwen 3 - a30b (below)
Lots of interesting takeaways from the Random Rewards paper. NOT that RL is dead, but honestly far more interesting than that!
The 4 series of models are good. REALLY GOOD. They're one-shotting complex series of 100s of tool calls without issue, on things Sonnet 3.7 failed.
The 4 series of models are good. REALLY GOOD. They're one-shotting complex series of 100s of tool calls without issue, on things Sonnet 3.7 failed.
I was just trying to talk to Opus - definitely no jailbreaks. This model is something different. Definitely creative.
I was just trying to talk to Opus - definitely no jailbreaks. This model is something different. Definitely creative.
I'll open source or share the link once I can clean it up - still using my keys, drop email/twitter in comments
Sonnet looking through the thing 👇
I'll open source or share the link once I can clean it up - still using my keys, drop email/twitter in comments
Sonnet looking through the thing 👇
None of them got it right (or even identified the right part) even after I cut it down to 10 pages.
Eventually -
None of them got it right (or even identified the right part) even after I cut it down to 10 pages.
Eventually -
A lot of food analogies in the post. I do not recommend reading this on an empty stomach
A lot of food analogies in the post. I do not recommend reading this on an empty stomach
Everything I know.
Enjoy.
Everything I know.
Enjoy.
Improvement over SoTA: trying to solve LLM consistency over very long outputs, and output adherence to things like timestamps - where a 500ms change is noticeable.
github.com/hrishioa/ipgu
Improvement over SoTA: trying to solve LLM consistency over very long outputs, and output adherence to things like timestamps - where a 500ms change is noticeable.
github.com/hrishioa/ipgu
AI with side effects
AI with side effects
Took an hour or two and made something that can push notes and outputs from Lumentis straight to Notion
Been writing more with Cursor, and pushing it to Notion
Took an hour or two and made something that can push notes and outputs from Lumentis straight to Notion
Been writing more with Cursor, and pushing it to Notion
Is connecting with the self really just feeling the internal latent space?
Is connecting with the self really just feeling the internal latent space?