corbt.bsky.social
@corbt.bsky.social
Does feel like Gemini models are reliably better at needle in a haystack style tasks.
May 7, 2025 at 7:39 PM
Yes, a wonderful marketing strategy for Tesla imo
April 26, 2025 at 5:25 AM
Surprising convergence of abilities in model releases over the past 6 months
April 19, 2025 at 7:33 AM
o3 and o4-mini seem like evidence in favor of the flywheel hypothesis, though we're certainly not very far down the takeoff trajectory yet.
April 18, 2025 at 11:35 PM
Yes
April 15, 2025 at 7:37 AM
Even after your explanation, still unclear to me how an agent that another agent can offload work to differs from a tool that an agent can call?
April 9, 2025 at 5:59 PM
If anything, you are too active on this platform.
April 5, 2025 at 12:33 AM
This is awesome! We're working on something similar for web use agents!
April 5, 2025 at 12:29 AM
😂
April 4, 2025 at 8:50 PM