Skip to main content
redditJanuary 30, 2026

Copilot Subagents: Parallel Orchestration Demo

Custom subagents with explicit model assignments and parallel execution now work in VS Code Insiders—conserving 95% of context window by delegating tasks to specialized agents.

Summary

A VS Code Insiders user demonstrates a multi-agent orchestration system using GitHub Copilot's new custom subagent features. The setup uses different models for different roles—Sonnet 4.5 for orchestration, Gemini 3 Flash for exploration, GPT 5.2 for research—running in parallel while preserving main context.

Key Points from OP

  • Parallel subagent execution now works in VS Code Insiders (Jan 2026 build)
  • Tasks that consumed 80K tokens in a single agent now use under 4K via delegation
  • Main orchestrator (Atlas) only delegates and synthesizes—no direct work
  • Subagents are defined via [agent-name].agent.md files with model assignments

Agent Architecture

AgentModelRole
AtlasSonnet 4.5Orchestrator—delegates and synthesizes
PrometheusGPT 5.2 HighDeep research and planning, writes to plan directories
OracleGPT 5.2Research and analysis
SisyphusSonnet 4.5Task execution from plans
ExplorerGemini 3 FlashFile discovery, web fetches
Code ReviewGPT 5.2Autonomous code review
Frontend EngineerGemini 3 ProUI/UX specialist

Notable Comments

"If anyone is wondering on the technicalities, they are just documentation files, in markdown (.md) format, which Copilot recognizes and auto-loads them on every message, if you name them agent-name.agent.md. The sub-agents can take Copilot to the next level in regards to its context-window."

— u/SourceCodeplz (2 points)

"What's interesting to me is not the 'parallel execution' aspect but more the context size optimization. Big functional prompts or zero-to-POC prompts are probably going to benefit the most from the system. But what about 'fix that error' prompts?"

— u/WSATX (1 point)

Model Assignment Caveat

Several commenters report that model assignments in subagent definitions may be ignored—subagents appear to use the main chat's model regardless of what's defined in their .agent.md file. One commenter tested with reasoningEffort: xhigh (only supported by GPT 5.2 Codex) and found subagents ran without errors even when assigned GPT 5-mini, suggesting the model setting was discarded.

Discussion Takeaways

  • Context conservation is the primary value—not just parallelism
  • Subagent cost structure unclear: some claim subagents don't consume additional premium requests
  • Enabling requires: "chat.customAgentInSubagent.enabled": true in VS Code settings
  • Nested subagents (sub-sub-agents) not yet supported
  • Features currently Insiders-only, expected in public VS Code within weeks

Resources

Connections