Building with Claude 4: The Parallel Agent Workflow
Claude 4's extended thinking and tool use capabilities change what parallel agents can accomplish.
Claude 4 — specifically Claude Opus 4 — represents a meaningful capability jump for long-horizon agentic tasks. Extended thinking, better tool use reliability, stronger code generation. For parallel agent workflows, this matters because the limiting factor was always agent capability per session, not the ability to run many sessions. With Claude 4, the per-agent output quality is higher, which multiplies across a fleet.
What Claude 4 Changes for Agents
The most important improvements for agentic workflows:
- Extended thinking: Claude 4 can reason through complex problems before generating code. This reduces the "plausible but wrong" outputs that required re-prompting.
- Tool use reliability: Agents use bash, read files, write code, run tests. Claude 4's tool use is more reliable — fewer hallucinated tool calls, better error recovery.
- Context utilization: Claude 4 makes better use of its context window, meaning agents can hold more of your codebase in mind when making changes.
- Instruction following: Complex, multi-step task descriptions are executed more faithfully. You can give agents richer specifications and expect better compliance.
Structuring Tasks for Claude 4
With a more capable model, the bottleneck shifts from agent capability to task specification. Here's how I structure tasks for Claude 4 agents:
- Give context: Include file paths, relevant existing code patterns, and the overall architecture. Claude 4 uses this effectively.
- Specify constraints: What should the agent NOT do? Don't modify these files, don't add new dependencies, maintain this interface. Explicit constraints prevent scope creep.
- Define done: What does success look like? "All tests passing" or "The endpoint returns 200 for these inputs" gives the agent a clear target to verify against.
Fleet Configuration for Claude 4
In your amux session config, you can specify which model each agent uses. For complex architectural tasks, use Opus 4. For high-volume, pattern-following work (test generation, documentation, boilerplate), use Sonnet 4 or Haiku to keep costs down.
# Complex tasks: Opus 4
amux register architect --dir ~/project --model claude-opus-4 --yolo
# High-volume tasks: Sonnet for balance
amux register test-writer --dir ~/project --model claude-sonnet-4 --yolo
The Model Mix Strategy
The most cost-effective fleet uses mixed models: 1-2 Opus 4 agents for complex reasoning and architecture decisions, 5-10 Sonnet 4 agents for implementation, and Haiku agents for search, triage, and documentation. This gives you frontier capability where it matters and lower cost where it doesn't.
Get started with amux
Run dozens of Claude Code agents in parallel. Python 3 + tmux. Open source.
git clone https://github.com/mixpeek/amux && cd amux && ./install.sh
amux register myproject --dir ~/Dev/myproject --yolo
amux start myproject
amux serve # → https://localhost:8822View on GitHub