feat(cockpit): chat/subagents renders inline subagent cards (real subgraph + aimock e2e) by blove · Pull Request #718 · cacheplane/angular-agent-framework

blove · 2026-06-20T02:17:37Z

Summary

Follow-up to #711 (inline persistent subagent cards). The cockpit/chat/subagents demo used a flat inline task tool, so agent.subagents() never populated and it showed a generic "task" chip instead of a subagent card. This aligns it with the working examples/chat pattern so it renders the inline persistent chat-subagent-card — on both the live LangGraph runtime and under aimock e2e — and adds the now-possible card assertion to examples/chat too.

What changed

Backend (cockpit/chat/subagents/python): the task tool now invokes a real compiled subagent subgraph (subagent_type is a required Literal, so it reaches the tool-call args the SubagentTracker registers on, and the child subgraph's tools:<id> namespace matches it). Each subagent is a single deterministic LLM call (no within-subagent tool loop) — see below.
Frontend: subagentToolNames: ['task'] on provideAgent; removed the now-redundant active-only <chat-subagents> sidebar tray (the inline persistent cards supersede it); corrected the pipeline note to research/booking/itinerary.
Fixtures: re-recorded c-subagents.json against the new graph.
e2e: cockpit/chat/subagents and examples/chat/research-subagent now assert the inline chat-subagent-card (cockpit asserts exactly 3, persisting).

Why single-call subagents

The first attempt kept the per-subagent tool loop. Under aimock replay the run errored with 404 no_fixture_match: a nested subagent's tool-loop rounds present local discriminators (turnIndex/hasToolResult) that the recorder captured against the global conversation, so they don't match on replay. Collapsing each subagent to one LLM call gives every subagent request a unique, stable discriminator (its role task_description), which aimock matches deterministically. Within-subagent tool calling is the dedicated tool-calls cap's concern; this cap demonstrates subagent orchestration + the card.

Verification

cockpit-chat-subagents e2e — green (asserts chat-subagent-card ×3, persists).
examples-chat e2e — 42 passed (incl. research-subagent card assertion).
Both apps build green.
Live smoke (LangGraph, real key): 3 subagents → 3 cards, 0 duplicate ticks, 0 NG0956, all collapsed/complete (research/booking/itinerary), no leftover task chips.

Spec: docs/superpowers/specs/2026-06-19-cockpit-subagents-subgraph-design.md · Plan: docs/superpowers/plans/2026-06-19-cockpit-subagents-subgraph.md

🤖 Generated with Claude Code

…graph card parity)

…subagent_type) Replace the flat inline _run_subagent() with a compiled parameterized StateGraph invoked by the task tool, so LangGraph nests it under a tools:<call_id> namespace and the SubagentTracker can surface a card. task() now takes subagent_type (Literal, required) so it always reaches the tool-call args the tracker registers on.

…bagents tray Inline persistent subagent cards (via <chat>) now surface each dispatch in conversation, so the active-only sidebar <chat-subagents> panel is removed. Pipeline note corrected to research/booking/itinerary.

…raph (captures subagent_type research/booking/itinerary)

… card e2e Each subagent is now ONE LLM call (no within-subagent tool loop), so its request carries a unique, stable discriminator (the role task_description) — the nested tool-loop rounds couldn't be matched by aimock's turnIndex/ hasToolResult scheme (404 no_fixture_match). Re-recorded the fixture and the c-subagents e2e now asserts the inline chat-subagent-card (3 cards, persists).

…bagent e2e

…nts-subgraph

vercel · 2026-06-20T02:17:43Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
threadplane	Ready	Preview, Comment	Jun 20, 2026 3:23am

github-actions · 2026-06-20T02:18:41Z

Claude finished @blove's task in 2m 28s —— View job

PR Review

Read CLAUDE.md and understand repo guidelines
Read the PR diff and changed files
Review Python backend changes (graph.py)
Review Angular frontend changes (app.config.ts, subagents.component.ts)
Review e2e test changes
Review fixture changes
Post findings

Summary

Solid follow-up to #711. The core diagnosis is correct — the old flat @tool never created a tools:<call_id> namespace, so SubagentTracker had nothing to match and the inline cards never rendered. Compiling a real child StateGraph and renaming role → subagent_type fixes both problems cleanly. The single-call-per-subagent tradeoff to stabilize aimock replay is well-motivated and well-documented.

Correctness observations

task_description vs description in namespace matching

SubagentTracker.matchSubgraphToSubagent (in subagent-tracker.ts) first tries to match a subgraph namespace to a registered tool-call by comparing args['description'] (lines 137, 144) to the child graph's reported description. The task tool args carry task_description, not description, so this path always misses and the code falls through to the positional "first unmatched pending/running" fallback (line 152). For sequential deterministic dispatch this is fine — registration order equals dispatch order. But it means the description-based fast-path is effectively dead for this pattern. No bug, but worth documenting: if a future cap dispatches subagents with the same task_description for different slots, positional matching is the only disambiguator.

_SUBAGENT_PROMPTS.get silent fallback — inline comment posted (minor; a log line is enough).

Count assertion timeout — inline comment posted (cosmetic; safe in practice).

Other notes

Removing ChatSubagentsComponent / ChatSubagentCardComponent from the cockpit imports is correct — the inline card is now composed inside <chat> by default, so the explicit sidebar tray was redundant.
subagentToolNames: ['task'] in app.config.ts is the right toggle. The SubagentTracker's default already includes 'task', but being explicit here makes the cap self-documenting. ✓
Fixture reduction from ~102 to ~26 lines (removing the nested tool-call round-trips) is expected and clean.
_final_text defensively handles both string and list-block content formats — good coverage for streaming edge cases.

Verdict

Approve with minor nits. The two inline suggestions are low-priority (a log line and an explicit timeout). The implementation is correct and the e2e coverage is meaningfully stronger than before.

github-actions

Automated approval: this PR received an intelligent (AI) code review. See the review comments on this PR.

github-actions · 2026-06-20T02:20:50Z

+    @threadplane/langgraph SubagentTracker matches to surface the card."""
+    subagent_type = state["subagent_type"]
+    task_description = state["task_description"]
+    system_prompt = _SUBAGENT_PROMPTS.get(subagent_type, _ITINERARY_PROMPT)


Silent fallback to _ITINERARY_PROMPT for an unrecognised subagent_type (e.g. an LLM hallucination) will produce a confusing itinerary response with no indication anything went wrong. A short print / log line on the else-path would make this much easier to debug in production:

Suggested change

system_prompt = _SUBAGENT_PROMPTS.get(subagent_type, _ITINERARY_PROMPT)

system_prompt = _SUBAGENT_PROMPTS.get(subagent_type)

if system_prompt is None:

print(f"[_subagent_node] unknown subagent_type={subagent_type!r}; falling back to itinerary", flush=True)

system_prompt = _ITINERARY_PROMPT

github-actions · 2026-06-20T02:20:54Z

+  await expect(page.locator('chat-subagent-card')).toHaveCount(3);

  // Final summary text contains an aviation-related phrase from the captured
  // continuation. Loose regex so refactors to the subagent prompts (research/


toHaveCount carries Playwright's default assertion timeout (usually 5 s). Since submitAndWaitForResponse already waits for the agent to go idle, all 3 cards should be in the DOM by now, so this is safe in practice. But making the intent explicit avoids a future head-scratcher if the project timeout ever changes:

Suggested change

// continuation. Loose regex so refactors to the subagent prompts (research/

await expect(page.locator('chat-subagent-card')).toHaveCount(3, { timeout: 15_000 });

…nts-subgraph

github-actions · 2026-06-20T03:05:28Z

Claude finished @blove's task in 3m 18s —— View job

PR Review

Read CLAUDE.md and understand repo guidelines
Read PR diff and changed files
Review Python backend (graph.py)
Review Angular frontend (app.config.ts, subagents.component.ts)
Review e2e tests and fixtures
Post findings

Summary

Solid follow-up. The root cause diagnosis is correct: the old flat @tool never created a tools:<call_id> namespace, so SubagentTracker had nothing to match — hence no cards. Compiling a real child StateGraph and renaming role → subagent_type fixes both registration (isValidSubagentType(args['subagent_type']) now passes) and namespace matching.

Verification of tracker behaviour

Confirmed by reading subagent-tracker.ts:

Registration (registerFromToolCalls, line 74): reads args['subagent_type'] — the new tool arg name. Passes for "research" / "booking" / "itinerary". ✓
Matching (matchSubgraphToSubagent, lines 135–154): the description-based fast-paths read args['description'], which doesn't exist in the new tool args, so they always fall through to the positional fallback (first unmatched pending/running subagent). For sequential deterministic dispatch this is correct. The PR description documents this behaviour accurately.
Card persistence: getSubagents() returns all subagents with status !== 'pending', so completed cards remain in the DOM — the toHaveCount(3) assertion is correctly placed after submitAndWaitForResponse. ✓

Open nits (from previous review, not yet addressed)

Two inline suggestions re-posted against the current commit — both low-priority:

_SUBAGENT_PROMPTS.get silent fallback (graph.py:124) — an unknown subagent_type silently falls back to _ITINERARY_PROMPT; a print on the else-path makes it debuggable.
toHaveCount(3) without explicit timeout (c-subagents.spec.ts:23) — safe in practice since the agent is idle by then, but an explicit { timeout: 15_000 } self-documents intent.

One new minor observation

ChatOpenAI(model="gpt-5-mini", streaming=True) is instantiated fresh on every _subagent_node invocation (graph.py:126). For a demo cap this is fine, but moving it to module-level would avoid recreating the client object on each of the three subagent calls. Not a correctness issue.

Verdict

Implementation is correct. The e2e coverage is meaningfully stronger. Two pre-existing nits remain (inline suggestions on the current commit above).

github-actions · 2026-06-20T03:08:30Z

+    @threadplane/langgraph SubagentTracker matches to surface the card."""
+    subagent_type = state["subagent_type"]
+    task_description = state["task_description"]
+    system_prompt = _SUBAGENT_PROMPTS.get(subagent_type, _ITINERARY_PROMPT)


The silent fallback to _ITINERARY_PROMPT for an unrecognised subagent_type produces a confusing itinerary response with no indication anything went wrong. This was flagged in the previous review pass and is still unaddressed. A one-liner is enough:

Suggested change

system_prompt = _SUBAGENT_PROMPTS.get(subagent_type, _ITINERARY_PROMPT)

system_prompt = _SUBAGENT_PROMPTS.get(subagent_type)

if system_prompt is None:

print(f"[_subagent_node] unknown subagent_type={subagent_type!r}; falling back to itinerary", flush=True)

system_prompt = _ITINERARY_PROMPT

github-actions · 2026-06-20T03:08:33Z

+
+  // One card per subagent dispatched (research/booking/itinerary), no
+  // duplicates — the orchestrator calls task three times in order.
+  await expect(page.locator('chat-subagent-card')).toHaveCount(3);


toHaveCount inherits the project's default assertion timeout. submitAndWaitForResponse already waits for idle, so all 3 cards should be present — but making the intent explicit protects against a future default-timeout change and self-documents the expected wall-clock. This was flagged in the previous review pass and is still unaddressed:

Suggested change

await expect(page.locator('chat-subagent-card')).toHaveCount(3);

await expect(page.locator('chat-subagent-card')).toHaveCount(3, { timeout: 15_000 });

…nts-subgraph

github-actions · 2026-06-20T03:13:11Z

Claude finished @blove's task in 1m 54s —— View job

PR Review

Read CLAUDE.md and repo guidelines
Read the full PR diff and changed files
Check if prior review nits were addressed
Review for correctness, type-safety, and new issues
Post findings

Summary

This synchronize push merges recent main commits into the branch — no substantive changes to the PR implementation. The core work (real subagent_subgraph, subagent_type arg, subagentToolNames: ['task'], fixture re-record) remains correct. The sync introduced no conflicts.

Two outstanding nits (flagged in prior two passes, still unaddressed)

graph.py:124 — silent fallback — _SUBAGENT_PROMPTS.get(subagent_type, _ITINERARY_PROMPT) silently produces an itinerary response for any unrecognised subagent_type. Inline suggestion re-posted: add a print on the else-path.
c-subagents.spec.ts:23 — toHaveCount(3) without explicit timeout — inherits the project default (5 s). Safe now since submitAndWaitForResponse already idles, but a { timeout: 15_000 } self-documents intent. Inline suggestion re-posted.

Both are one-liners. Everything else looks good.

github-actions · 2026-06-20T03:14:57Z

+    @threadplane/langgraph SubagentTracker matches to surface the card."""
+    subagent_type = state["subagent_type"]
+    task_description = state["task_description"]
+    system_prompt = _SUBAGENT_PROMPTS.get(subagent_type, _ITINERARY_PROMPT)


Still unaddressed from the previous two review passes. An unrecognised subagent_type silently produces an itinerary response — add a log line so this is observable when it happens:

Suggested change

system_prompt = _SUBAGENT_PROMPTS.get(subagent_type, _ITINERARY_PROMPT)

system_prompt = _SUBAGENT_PROMPTS.get(subagent_type)

if system_prompt is None:

print(f"[_subagent_node] unknown subagent_type={subagent_type!r}; falling back to itinerary", flush=True)

system_prompt = _ITINERARY_PROMPT

github-actions · 2026-06-20T03:15:00Z

+
+  // One card per subagent dispatched (research/booking/itinerary), no
+  // duplicates — the orchestrator calls task three times in order.
+  await expect(page.locator('chat-subagent-card')).toHaveCount(3);


Still unaddressed from the previous two review passes. toHaveCount inherits the project's default assertion timeout — submitAndWaitForResponse already waits for idle so all 3 cards should be present, but an explicit timeout self-documents intent and insulates the assertion from a future default-timeout change:

Suggested change

await expect(page.locator('chat-subagent-card')).toHaveCount(3);

await expect(page.locator('chat-subagent-card')).toHaveCount(3, { timeout: 15_000 });

blove added 8 commits June 19, 2026 17:20

docs: design for cockpit chat/subagents real-subgraph alignment (lang…

523328c

…graph card parity)

docs: plan for cockpit chat/subagents real-subgraph alignment

3930d0b

test(cockpit): re-record c-subagents aimock fixture for subagent subg…

5b3c392

…raph (captures subagent_type research/booking/itinerary)

test(examples): assert inline persistent subagent card in research-su…

bd11b01

…bagent e2e

Merge remote-tracking branch 'origin/main' into claude/cockpit-subage…

6a1244b

…nts-subgraph

blove enabled auto-merge (squash) June 20, 2026 02:17

github-actions Bot approved these changes Jun 20, 2026

View reviewed changes

github-actions Bot reviewed Jun 20, 2026

View reviewed changes

Merge remote-tracking branch 'origin/main' into claude/cockpit-subage…

4bfb00f

…nts-subgraph

github-actions Bot reviewed Jun 20, 2026

View reviewed changes

Merge remote-tracking branch 'origin/main' into claude/cockpit-subage…

5281780

…nts-subgraph

github-actions Bot reviewed Jun 20, 2026

View reviewed changes

vercel Bot deployed to Preview – threadplane June 20, 2026 03:23 View deployment

blove merged commit abc41a0 into main Jun 20, 2026
31 checks passed

blove mentioned this pull request Jun 20, 2026

fix(examples): drop redundant subagents sidebar tray in canonical demo (post-#711 dedup) #722

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(cockpit): chat/subagents renders inline subagent cards (real subgraph + aimock e2e)#718

feat(cockpit): chat/subagents renders inline subagent cards (real subgraph + aimock e2e)#718
blove merged 10 commits into
mainfrom
claude/cockpit-subagents-subgraph

blove commented Jun 20, 2026

Uh oh!

vercel Bot commented Jun 20, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 20, 2026 •

edited

Loading

Uh oh!

github-actions Bot left a comment

Uh oh!

github-actions Bot Jun 20, 2026

Uh oh!

github-actions Bot Jun 20, 2026

Uh oh!

github-actions Bot commented Jun 20, 2026 •

edited

Loading

Uh oh!

github-actions Bot Jun 20, 2026

Uh oh!

github-actions Bot Jun 20, 2026

Uh oh!

github-actions Bot commented Jun 20, 2026 •

edited

Loading

Uh oh!

github-actions Bot Jun 20, 2026

Uh oh!

github-actions Bot Jun 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

-    system_prompt = _SUBAGENT_PROMPTS.get(subagent_type, _ITINERARY_PROMPT)
+    system_prompt = _SUBAGENT_PROMPTS.get(subagent_type)
+    if system_prompt is None:
+        print(f"[_subagent_node] unknown subagent_type={subagent_type!r}; falling back to itinerary", flush=True)
+        system_prompt = _ITINERARY_PROMPT

	// continuation. Loose regex so refactors to the subagent prompts (research/
	await expect(page.locator('chat-subagent-card')).toHaveCount(3, { timeout: 15_000 });

Conversation

blove commented Jun 20, 2026

Summary

What changed

Why single-call subagents

Verification

Uh oh!

vercel Bot commented Jun 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review

Summary

Correctness observations

Other notes

Verdict

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jun 20, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jun 20, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jun 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review

Summary

Verification of tracker behaviour

Open nits (from previous review, not yet addressed)

One new minor observation

Verdict

Uh oh!

github-actions Bot Jun 20, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jun 20, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jun 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review

Summary

Two outstanding nits (flagged in prior two passes, still unaddressed)

Uh oh!

github-actions Bot Jun 20, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jun 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vercel Bot commented Jun 20, 2026 •

edited

Loading

github-actions Bot commented Jun 20, 2026 •

edited

Loading

github-actions Bot commented Jun 20, 2026 •

edited

Loading

github-actions Bot commented Jun 20, 2026 •

edited

Loading