Writes at https://olickel.com
Here's a slice of a 4 HOUR run (~1 second per minute) with not much more than 'keep going' from me every 90 minutes or so.
moonshotai.github.io/Kimi-K2/
Here's a slice of a 4 HOUR run (~1 second per minute) with not much more than 'keep going' from me every 90 minutes or so.
moonshotai.github.io/Kimi-K2/
The key problems for modern LLM application design that get often overlooked (I think) are:
• Streaming outputs and partial parsing
• Context organization and management (I don't mean summarising at 90%)
The key problems for modern LLM application design that get often overlooked (I think) are:
• Streaming outputs and partial parsing
• Context organization and management (I don't mean summarising at 90%)
Most flows today treat NL as reasoning, code as execution, and structured data as an extraction method. There might be problems with this approach.
Most flows today treat NL as reasoning, code as execution, and structured data as an extraction method. There might be problems with this approach.
Verified on Qwen 3 - a30b (below)
Lots of interesting takeaways from the Random Rewards paper. NOT that RL is dead, but honestly far more interesting than that!
Verified on Qwen 3 - a30b (below)
Lots of interesting takeaways from the Random Rewards paper. NOT that RL is dead, but honestly far more interesting than that!
Managing context is key. Long tool calls can be killed with just one bad call that dumps a bunch of text into context. Both cursor models forgot after a while, and barely made it.
Managing context is key. Long tool calls can be killed with just one bad call that dumps a bunch of text into context. Both cursor models forgot after a while, and barely made it.
I was just trying to talk to Opus - definitely no jailbreaks. This model is something different. Definitely creative.
I was just trying to talk to Opus - definitely no jailbreaks. This model is something different. Definitely creative.
I'll open source or share the link once I can clean it up - still using my keys, drop email/twitter in comments
Sonnet looking through the thing 👇
I'll open source or share the link once I can clean it up - still using my keys, drop email/twitter in comments
Sonnet looking through the thing 👇
PDF processing in both models don't really seem multi-modal. Claude sometimes has glaucoma.
PDF processing in both models don't really seem multi-modal. Claude sometimes has glaucoma.
None of them got it right (or even identified the right part) even after I cut it down to 10 pages.
Eventually -
None of them got it right (or even identified the right part) even after I cut it down to 10 pages.
Eventually -
Everything I know.
Enjoy.
Everything I know.
Enjoy.
Took an hour or two and made something that can push notes and outputs from Lumentis straight to Notion
Been writing more with Cursor, and pushing it to Notion
Took an hour or two and made something that can push notes and outputs from Lumentis straight to Notion
Been writing more with Cursor, and pushing it to Notion