...how much are agents contributing versus executing grunt work for an expert who could have written the code?
Maybe "sophisticated code monkey" is the right abstraction.
...how much are agents contributing versus executing grunt work for an expert who could have written the code?
Maybe "sophisticated code monkey" is the right abstraction.
- Architectural decisions (what the spec says)
- Instruction authoring (translating domain knowledge)
- Quality gates (recognizing unsound designs)
Domain expertise doesn't die. It migrates up the stack.
- Architectural decisions (what the spec says)
- Instruction authoring (translating domain knowledge)
- Quality gates (recognizing unsound designs)
Domain expertise doesn't die. It migrates up the stack.
You need a domain expert writing instructions, checking architecture, steering toward compliance.
You need a domain expert writing instructions, checking architecture, steering toward compliance.
steve-yegge.medium.com/welcome-to-gas-town-4f25ee16dd04
steve-yegge.medium.com/welcome-to-gas-town-4f25ee16dd04
What worked: hierarchy. Planner/Worker/Judge. Workers don't coordinate - all coordination flows through hierarchy. Git worktrees for isolation.
What worked: hierarchy. Planner/Worker/Judge. Workers don't coordinate - all coordination flows through hierarchy. Git worktrees for isolation.
Anthropic Jan 2026: "Multi-agent should address constraints single agent cannot overcome."
Start with one good agent. Add more only when you can articulate exactly why.
Anthropic Jan 2026: "Multi-agent should address constraints single agent cannot overcome."
Start with one good agent. Add more only when you can articulate exactly why.
Cursor's FastRender built a browser with 100s of agents. Equal-status with locking failed (20 agents slowed to throughput of 2-3).
Planner/Worker/Judge succeeded. Workers isolated - all coordination through hierarchy.
Cursor's FastRender built a browser with 100s of agents. Equal-status with locking failed (20 agents slowed to throughput of 2-3).
Planner/Worker/Judge succeeded. Workers isolated - all coordination through hierarchy.
Multi-agent helps when base performance is low and agents can complement each other. When one agent is already decent, coordination overhead exceeds benefit.
Multi-agent helps when base performance is low and agents can complement each other. When one agent is already decent, coordination overhead exceeds benefit.
1. Flat topology (no hierarchy, agents as equals)
2. Noisy chatter (hallucination loops between peers)
3. Open-loop execution (no assurance plane checking outputs)
Sound familiar? This describes most multi-agent demos.
1. Flat topology (no hierarchy, agents as equals)
2. Noisy chatter (hallucination loops between peers)
3. Open-loop execution (no assurance plane checking outputs)
Sound familiar? This describes most multi-agent demos.
The gap between research and deployment is shrinking fast. Expect more soon.
The gap between research and deployment is shrinking fast. Expect more soon.
- MAGMA: orthogonal graphs with policy-guided traversal
- AgeMem: memory ops as tool-based actions
- Aeon: "memory as OS resource" with sub-1ms retrieval
Common theme: structure matters more than scale.
- MAGMA: orthogonal graphs with policy-guided traversal
- AgeMem: memory ops as tool-based actions
- Aeon: "memory as OS resource" with sub-1ms retrieval
Common theme: structure matters more than scale.
The agent doesn't just remember what happened - it extracts patterns that apply to new situations. Transcends situational constraints.
Still mostly research territory.
The agent doesn't just remember what happened - it extracts patterns that apply to new situations. Transcends situational constraints.
Still mostly research territory.
Instead of storing "user asked X, I replied Y", the agent generates insights: "user prefers brief responses" or "this topic connects to earlier thread."
More value per token stored.
Instead of storing "user asked X, I replied Y", the agent generates insights: "user prefers brief responses" or "this topic connects to earlier thread."
More value per token stored.
Problem: entangles temporal, causal, and entity information in flat vectors. "Vector haze" - you get semantically similar facts that are episodically disconnected.
Most current RAG systems live here.
Problem: entangles temporal, causal, and entity information in flat vectors. "Vector haze" - you get semantically similar facts that are episodically disconnected.
Most current RAG systems live here.