[copilot-cli-research] Copilot CLI Deep Research - 2026-05-02 #29682
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by Copilot CLI Deep Research Agent. A newer discussion is available at Discussion #29874. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Analysis Date: 2026-05-02
Repository: github/gh-aw
Scope: 207 total workflows, ~116 using Copilot engine (89 simple form + 22 + 5 object form; ~56%)
Previous Run: §25134300030 (2026-04-29)
📊 Executive Summary
Research Topic: Copilot CLI Optimization Opportunities — 5th Trend-Tracked Analysis
Key Findings: (1)
startup-timeout&tool-timeoutremain at 0% for 12+ consecutive runs — the most persistent critical gap; (2)max-continuationsstill near-zero despite being Copilot-exclusive; (3) 5 custom agent files are defined but never used; (4)mcp-scriptsusage has fluctuated between runs; (5) over half of all workflows now use Copilot, but engine.bare and engine.harness remain completely unused.Primary Recommendation: Enable
tools.startup-timeoutandtools.timeoutin long-running workflows — these provide cheap, zero-effort resilience against MCP server failures and runaway tool executions.Copilot remains the dominant engine at ~56% of workflows (up from 43% in the previous simple-count methodology). The repository has grown significantly (207 total workflows, +2 from last run), and safe-outputs adoption is strong. However, several Copilot-exclusive capabilities remain completely untapped after 12+ analysis cycles, suggesting documentation gaps or lack of visibility rather than intentional avoidance.
The most impactful untouched feature continues to be tool and startup timeouts — they require a single configuration line and protect against the most common class of workflow hangs. The second most impactful gap is
max-continuationsfor complex long-running tasks that currently fail when the agent runs out of turns.Critical Findings
🔴 High Priority Issues
1. Zero
startup-timeoutUsage (12th Consecutive Run)Not a single Copilot workflow uses
tools.startup-timeoutto guard against MCP server initialization failures. When an MCP server fails to start, the job hangs until thetimeout-minutesdeadline, wasting compute and runner time on every failure.2. Zero
tool-timeoutUsage (12th Consecutive Run)Similarly, no workflow sets
tools.timeoutto cap individual tool call duration. This means a single slow tool call (e.g., a network-fetching MCP tool under degraded conditions) can exhaust the entire job timeout.3.
max-continuationsNear-Zero (Copilot-Exclusive Feature)Only ~2 workflows use
max-continuationsdespite it being the only way to handle tasks that require more work than a single agent session allows. Many complex daily/weekly workflows (code metrics, security analysis, architecture review) could benefit from autopilot multi-run mode.🟡 Medium Priority Opportunities
4. 5 Custom Agent Files Defined but Never Deployed
.github/agents/contains 11 agent files; 6 are being used (awf,technical-doc-writer,contribution-checker,agentic-workflows,adr-writer,developer.instructions), but 5 remain completely idle:grumpy-reviewer.agent.mdw3c-specification-writer.agent.mdcreate-safe-output-type.agent.mdcustom-engine-implementation.agent.mdinteractive-agent-designer.agent.md5.
mcp-scriptsAdoption UnstablePrevious runs showed 6 workflows using
mcp-scripts; this run counted 0 using simple^mcp-scripts:. This suggests count methodology differences. The feature is powerful for inline custom tools but underutilized relative to its capabilities.6. ~37 Workflows Without
strict: trueAmong copilot workflows, 37/116 (~32%) lack
strict: true. While not all workflows need it, those that use GitHub tools or handle PR/issue content benefit from the additional security controls.View Full Analysis
1️⃣ Current State Analysis
View Copilot CLI Capabilities Inventory
Copilot CLI Capabilities Inventory
Engine Configuration (
engine:object)engine.id: copilotengine.versionengine.modelengine.agent.github/agents/engine.argsengine.envengine.bare--no-custom-instructions)engine.harnessengine.api-targetengine.mcp.session-timeoutengine.mcp.tool-timeoutExecution & Autonomy
max-continuationssandbox.agent: awfsandbox.agent: srtTool & MCP Configuration
tools.startup-timeouttools.timeoutmcp-scriptstools.cache-memorySafety & Quality
strict: truefeatures.copilot-requestsnetwork.allowedView Usage Statistics
Usage Statistics (Current Run)
engine: copilot= 89, plusid: copilotobject form = 22+)Most Used Configurations:
timeout-minutes: 30— 23 workflows (most common)timeout-minutes: 20— 20 workflowstools.github mode: gh-proxy— 56 workflowstoolsets: [default]— 14 workflows (over-provisioned)imports— 70 workflowsTimeout Distribution:
2️⃣ Feature Usage Matrix
--add-dir,--autopilot,--no-custom-instructions,--disable-builtin-mcps3️⃣ Missed Opportunities
View High Priority Opportunities
🔴 High Priority
Opportunity 1: MCP Tool & Startup Timeouts
tools.startup-timeoutcaps MCP server initialization time;tools.timeoutcaps individual tool call durationtimeout-minutesexpires, wasting minutes of runner time and causing false workflow failuresgithub:,brave:,playwright:, etc.) — that's nearly all copilot workflowsOpportunity 2:
max-continuationsfor Complex Long Tasksarchitecture-guardian.md,security-review.md,daily-code-metrics.md,copilot-opt.md, any workflow withtimeout-minutes: 45+View Medium Priority Opportunities
🟡 Medium Priority
Opportunity 3: Deploy Unused Custom Agent Files
5 custom agent files in
.github/agents/have never been referenced by any workflow:grumpy-reviewer.agent.mdw3c-specification-writer.agent.mdcreate-safe-output-type.agent.mdcustom-engine-implementation.agent.mdinteractive-agent-designer.agent.mdThese represent significant invested work with zero return. Either create workflows to use them or prune them to reduce maintenance burden.
Opportunity 4: Missing
strict: trueon 37 WorkflowsWorkflows accessing repository content, PRs, or issues without
strict: trueare more vulnerable to prompt injection. Recommend addingstrict: trueto all workflows that:tools.githubwith write-capable toolsetssafe-outputsthat can create/modify issues/PRsKey workflows currently missing strict mode:
agent-performance-analyzer.md,breaking-change-checker.md,code-scanning-fixer.md,dead-code-remover.mdOpportunity 5:
features.copilot-requests: truefor All Copilot WorkflowsOnly ~33% of Copilot workflows track token consumption via
features.copilot-requests: true. This metric is essential for cost attribution and understanding which workflows consume the most Copilot resources. Should be added to all Copilot workflows as a baseline.Opportunity 6: GitHub Toolset Scoping Improvements
Many workflows use
toolsets: [default]which grants broad GitHub access. Workflows should use the minimum required toolset:toolsets: [issues]toolsets: [pull_requests]toolsets: [repos]toolsets: [issues](safe-outputs handles writes)Overly broad toolsets increase the blast radius of a compromised workflow.
View Low Priority Opportunities
🟢 Low Priority
Opportunity 7: Engine Version Pinning for Critical Workflows
Production workflows (
copilot-opt.md,security-review.md, daily analysis workflows) could benefit from pinning the Copilot CLI version to ensure reproducible behavior after CLI updates.Opportunity 8: Custom Harness Script (
engine.harness)For power users:
engine.harnessallows replacing the built-in Node.js harness with a custom script. Zero usage currently. Could be valuable for:Opportunity 9: Two Missing Timeouts
2 workflows have no
timeout-minutesset at all. Every workflow should set an explicit timeout:4️⃣ Specific Workflow Recommendations
View Workflow-Specific Recommendations
architecture-guardian.mdtimeout-minutes: 20, nomax-continuationsmax-continuations: 2for deep analysis runs, addtools.startup-timeout: 30scopilot-opt.mdtimeout-minutes: 30features.copilot-requests: true, addtools.timeout: 3msecurity-review.mdtools.startup-timeouttools.startup-timeout: 45s, addmax-continuations: 2daily-code-metrics.mdstrict: true,features.copilot-requests: true,tools.startup-timeout: 30s5️⃣ Trends & Insights
View Historical Trends (5 Runs)
startup-timeouttool-timeoutmax-continuationsmcp-scriptsengine.bareengine.modelcache-memorysandbox.awf*cache-memory count varies by counting methodology (direct
cache-memory:vs. all forms)Key Trend:
startup-timeoutandtool-timeoutare now the longest-standing unaddressed gap — 12 consecutive analysis cycles with 0% adoption. This is a clear signal for a targeted nudge or documentation improvement.Positive Trend:
mcp-scriptsadoption jumped from 1 → 6 between runs 2 and 3, suggesting word spread.cache-memoryandrepo-memoryboth growing.Concerning: 5 custom agent files remain unused across all 5 tracked runs.
6️⃣ Best Practice Guidelines
Always set tool timeouts: Add
tools.startup-timeout: 30sandtools.timeout: 2mto every workflow using MCP servers. This is the single highest-ROI configuration change.Match toolsets to actual needs: Use
toolsets: [issues]instead oftoolsets: [default]when you only need issue access. Principle of least privilege applies to GitHub tool access too.Add
features.copilot-requests: trueto all Copilot workflows: Token visibility is free — there's no reason not to enable it on every workflow.Use
max-continuationsfor complex tasks: Any workflow withtimeout-minutes: 45+that doesn't usemax-continuationsis likely silently truncating work. Addmax-continuations: 2-3to allow completion.Prune or activate dormant agent files: If a custom agent file hasn't been used after 5+ analysis cycles, either create a workflow to use it or remove it to reduce cognitive overhead.
Enable
strict: trueon all workflows that touch untrusted content: Default to secure; opt out only when there's a specific reason.7️⃣ Action Items
Immediate Actions (this week):
tools.startup-timeout: 30sto the 10 most critical Copilot workflowsfeatures.copilot-requests: trueto all workflows missing it (~78 workflows)Short-term (this month):
max-continuations: 2-3to complex analysis workflows (architecture-guardian, security-review, copilot-opt)strict: trueand add where appropriate[default]with minimal required toolsetsLong-term (this quarter):
engine.harnessfor workflows needing custom execution behaviorengine.versionpinning for critical production workflowsmcp-scriptsfor custom read-only API integrationsView Supporting Evidence & Methodology
📚 References
.github/agents/(11 files)/tmp/gh-aw/repo-memory/default/copilot-research-notes.mdResearch Methodology
This analysis was conducted by:
.github/workflows/*.mdfiles usinggrepfor engine declarations (engine: copilotandid: copilot)pkg/workflow/copilot_engine*.gofiles for available CLI featuresdocs/src/content/docs/reference/engines.mdfor documented featuresNote on counts: Simple-form count (
engine: copilot) = 89; object-form count (id: copilot) = 22; total = ~116. Previous runs also counted default-engine workflows (no engine declaration) bringing totals to ~110-116. Some feature counts (bare, model, mcp-scripts) are best-effort grep-based and may vary ±2.Generated by Copilot CLI Deep Research Agent (Run: §25243992723)
Beta Was this translation helpful? Give feedback.
All reactions