fix(gastown): rollback agent to idle on dispatch container start failure by jrf0110 · Pull Request #2000 · Kilo-Org/cloud

jrf0110 · 2026-04-04T15:15:07Z

Summary

When dispatch_agent's async side effect fails (container start fails), the agent was left in working status with no container running, creating a 90-second dead zone until heartbeat timeout detection kicked in.

Now the .catch() handler in the dispatch_agent action rolls the agent back to idle via agentOps.updateAgentStatus() so the reconciler can retry dispatch on the next tick, per spec §5.4. The bead stays in_progress — no transition needed.

Updated comment to document the rollback behavior
Added rolling back to idle to the warning log message for observability

Verification

Typecheck passes
Reconciler-related tests pass (2 pre-existing failures in unrelated client.test.ts)
Manual review confirms the SQL write in .catch() is safe — it's part of the promise chain awaited by Promise.allSettled in Phase 2 of the alarm loop

Visual Changes

N/A

Reviewer Notes

The idle agent remains hooked to the in_progress bead. The reconciler handles this state: reconcileBeads will see an unassigned in_progress bead with an idle agent and re-dispatch.
There is an empty "WIP: container eviction save" commit (no file changes) — harmless artifact from the polecat's workflow.

When dispatch_agent's async side effect fails (container start fails), the agent was left in 'working' status with no container running, creating a 90-second dead zone until heartbeat timeout detection. Now the .catch() handler rolls the agent back to 'idle' so the reconciler can retry dispatch on the next tick, per spec §5.4.

kilo-code-bot · 2026-04-04T15:21:23Z

cloudflare-gastown/src/dos/town/actions.ts

+        // Best-effort dispatch. If the container start fails, roll the
+        // agent back to 'idle' so the reconciler can retry on the next
+        // tick. The bead stays 'in_progress' — no transition needed.
        await ctx.dispatchAgent(capturedAgentId, beadId, rigId).catch(err => {


WARNING: This only rolls back rejected dispatches

ctx.dispatchAgent() resolves false for normal startup failure paths in scheduling.dispatchAgent() (for example when startAgentInContainer() returns false or the rig lookup fails). In those cases this .catch() never runs, so the agent still stays working and the 90-second dead zone remains. Handle a falsy return here as well if the goal is to roll the agent back to idle on failed container starts.

kilo-code-bot · 2026-04-04T15:21:36Z

Code Review Summary

Status: 1 Issues Found | Recommendation: Address before merge

Overview

Severity	Count
CRITICAL	0
WARNING	1
SUGGESTION	0

Fix these issues in Kilo Cloud

Issue Details (click to expand)

WARNING

File	Line	Issue
`cloudflare-gastown/src/dos/town/actions.ts`	564	Rollback only runs on rejected dispatches, so `dispatchAgent()` failures that resolve `false` still leave the agent stuck in `working`.

Other Observations (not in diff)

N/A

Files Reviewed (1 files)

cloudflare-gastown/src/dos/town/actions.ts - 1 issue

_{Reviewed by gpt-5.4-20260305 · 284,225 tokens}

John Fawcett added 2 commits April 4, 2026 15:01

WIP: container eviction save

dd0ee04

kilo-code-bot bot reviewed Apr 4, 2026

View reviewed changes

kilo-code-bot bot closed this Apr 5, 2026

kilo-code-bot bot deleted the convoy/fix-reconciler-p0-p1-bug-fixes-from-audi/f071bd6c/gt/maple/62e5c90a branch April 5, 2026 00:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(gastown): rollback agent to idle on dispatch container start failure#2000

fix(gastown): rollback agent to idle on dispatch container start failure#2000
jrf0110 wants to merge 2 commits intoconvoy/fix-reconciler-p0-p1-bug-fixes-from-audi/f071bd6c/headfrom
convoy/fix-reconciler-p0-p1-bug-fixes-from-audi/f071bd6c/gt/maple/62e5c90a

jrf0110 commented Apr 4, 2026

Uh oh!

kilo-code-bot bot Apr 4, 2026

Uh oh!

kilo-code-bot bot commented Apr 4, 2026 •

edited

Loading

WARNING

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jrf0110 commented Apr 4, 2026

Summary

Verification

Visual Changes

Reviewer Notes

Uh oh!

kilo-code-bot bot Apr 4, 2026

Choose a reason for hiding this comment

Uh oh!

kilo-code-bot bot commented Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review Summary

Overview

WARNING

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kilo-code-bot bot commented Apr 4, 2026 •

edited

Loading