Thread-Based Engineering: How to Ship Like Boris Cherny
Thread-based engineering provides a mental framework for measuring improvement with AI agents—scale by running more threads, longer threads, thicker threads, and fewer human checkpoints.
Key Takeaways
How do you know you're improving as an agentic engineer? IndyDevDan proposes thread-based engineering—a framework for thinking about your work with AI agents as discrete "threads" of work where you show up at the prompt and the review.
The Base Thread
A thread is a unit of engineering work over time driven by you and your agents. Every thread has two mandatory nodes:
- You prompt or plan at the beginning
- Your agent works through tool calls
- You review or validate at the end
Tool calls roughly equal impact. Pre-2023, engineers were the tool calls. Now agents handle the middle while you show up at the beginning and end.
The Six Thread Types
| Thread | Name | Description |
|---|---|---|
| Base | Single thread | One prompt → agent work → one review |
| P | Parallel | Multiple threads running simultaneously (Boris runs 5-10 Claude Codes in parallel) |
| C | Chained | Multi-phase work with intentional checkpoints for high-risk production work |
| F | Fusion | Same prompt to multiple agents, then aggregate the best results |
| B | Big/Nested | Meta-threads where agents prompt other agents (sub-agents, orchestrators) |
| L | Long | High-autonomy, extended duration work—hundreds of tool calls over hours |
There's also a hidden seventh: the Z-thread (zero touch)—maximum trust where you skip the review step entirely.
The Framework
flowchart LR
subgraph Base["Base Thread"]
P1[Prompt] --> TC[Tool Calls] --> R1[Review]
end
subgraph Scale["Scaling Dimensions"]
More[More Threads]
Longer[Longer Threads]
Thicker[Thicker Threads]
Fewer[Fewer Checkpoints]
end
Base --> Scale
Four Ways to Improve
- Run more threads — Spin up multiple terminals, work trees, or cloud instances
- Run longer threads — Better prompting and context engineering enable higher autonomy
- Run thicker threads — Nest threads inside threads (sub-agents, orchestrators)
- Reduce checkpoints — Build systems you trust so you can skip human-in-the-loop reviews
Boris Cherny's Setup (Referenced)
- 5 Claude Codes in terminal (tabs 1-5)
- 5-10 additional Claude Codes in the web interface for background work
- Uses stop hooks for very long-running tasks
- System notifications when Claude needs input
- Always uses Opus 4.5
Notable Quotes
"Tool calls roughly equal impact assuming you're prompting something useful. Pre-2023, you and I were the tool calls."
"The engineer kicking off five agents in five separate terminals is getting more done than the one running a single agent."
"If you don't measure it, you will not be able to improve it."
Connections
- ralph-wiggum-technique-guide — The Ralph Wiggum pattern mentioned throughout is a specific implementation of thread-based engineering (loop over agent for deterministic work)
- boris-cherny-on-what-grew-his-career-and-building-at-anthropic — Boris's actual interview where he discusses his Claude Code philosophy and the generalist engineering approach
- building-effective-agents — Anthropic's guide covers similar patterns: parallelization maps to P-threads, orchestrator-workers maps to B-threads