feat(auth): add first-run setup wizard by ndycode · Pull Request #103 · ndycode/codex-multi-auth

ndycode · 2026-03-16T08:47:52Z

Summary

Stacked PR on fresh/18-import-adapter-opencode
Reopens the preserved first-run wizard slice from archived PR #83
Current head: 2e22874

What Changed

Adds the first-run setup wizard before OAuth for brand-new installs, with restore, OpenCode import, settings, and doctor entry points.
Redacts first-run OpenCode import prompts and action-panel output to avoid leaking full filesystem paths in interactive flows.
Replaces the brittle import-stage string probe with explicit stage tracking so assessment failures stay labeled as assessment failures.
Adds direct Vitest coverage for the sync-history trim-skip threshold and the 0 vs null reset semantics used by the new append optimization.

Validation

npm run lint
npm run typecheck
npm run build
npm test
npm test -- test/codex-manager-cli.test.ts -t "imports OpenCode accounts from the first-run wizard|returns to the dashboard when OpenCode import fails|returns to the dashboard when OpenCode import assessment fails|redacts first-run backup listing warnings|falls back to a safe health summary when restore or rollback state reads fail"
npm test -- test/sync-history.test.ts -t "skips trim reads while append count stays within the default cap|reloads and trims once when append count crosses the default cap|re-reads seeded history after configureSyncHistoryForTests resets the estimate to null|preserves the latest entry per kind when appends exceed the default cap"

Docs and Governance Checklist

README updated (if user-visible behavior changed)
docs/getting-started.md updated (if onboarding flow changed)
docs/features.md updated (if capability surface changed)
relevant docs/reference/* pages updated (if commands/settings/paths changed)
docs/upgrade.md updated (if migration behavior changed)
SECURITY.md and CONTRIBUTING.md reviewed for alignment

Risk and Rollback

Risk level: medium
Rollback plan: revert ac2be34, 16ba251, 64a8ead, f5b74dc, and 2e22874

Additional Notes

This PR is intended to stay reviewable on top of fresh/18-import-adapter-opencode, not main.
The latest Greptile rerun completed successfully on 2e22874.
Remaining latest-head discussion is stale or non-blocking noise, not a concrete merge blocker.

note: greptile review for oc-chatgpt-multi-auth. cite files like `lib/foo.ts:123`. confirm regression tests + windows concurrency/token redaction coverage.

Greptile Summary

this pr lands the first-run setup wizard (restore, opencode import, settings, doctor entry points) on top of the fresh/18-import-adapter-opencode stack, and introduces a new sync-history.ts module with an append-optimized NDJSON history store. the wizard logic in codex-manager.ts is well-structured: assessment and import arms are in separate try/catch blocks, the doctor case is guarded, loadAccounts is wrapped after every wizard action, and path redaction is applied consistently throughout. the firstRunWizardShownInLoop flag correctly prevents double-firing.

windows safety gap in sync-history.ts: loadHistoryEntriesFromDisk is called unretried to seed historyEntryCountEstimate when it is null. all write paths use withRetryableHistoryWrite, but this read has no equivalent protection. on Windows, antivirus can return EBUSY on readFile, causing the entry to be silently lost before appendFile is ever reached. no vitest case covers this failure path.
dead-code deduplication in pruneSyncHistoryEntries: seenEntries compares object references, but every entry from loadHistoryEntriesFromDisk is a freshly allocated parse result. seenEntries.has(entry) is always false, so crash-duplicated JSON lines in the NDJSON file survive pruning unchanged.
redundant loadAccounts() inside wizard's assessOpencodeAccountPool call: both buildFirstRunWizardOptions and the import-opencode arm call assessOpencodeAccountPool() without passing currentStorage: null. the function then falls back to await loadAccounts() internally, which is a second unnecessary disk read in the first-run context (we already know storage is null). on Windows, this extra read is an additional EBUSY surface.
the sync-history.ts test file is solid: trim-skip threshold, reload-on-cap-cross, externally-cleared-file reset, and write-retry paths are all covered. the one gap is the EBUSY failure on the initial count-estimation readFile.

Confidence Score: 3/5

safe to merge after addressing the unretried count-estimation read in sync-history.ts; the wizard logic itself is sound and well-tested.
wizard logic in codex-manager.ts is correctly structured with proper error isolation, redaction, and the firstRunWizardShownInLoop guard. the main concern is lib/sync-history.ts: the count-estimation readFile has no retry wrapper, creating a real windows EBUSY data-loss path with no vitest coverage. the seenEntries dedup issue is minor but misleading. the redundant loadAccounts() in the wizard assessment calls is a smaller reliability concern. none of these block non-windows usage, but the unretried read is a genuine bug on windows.
lib/sync-history.ts — unretried count-estimation read and dead seenEntries dedup logic both need attention before merge on Windows targets.

Important Files Changed

Filename	Overview
lib/sync-history.ts	new file — introduces append-optimized NDJSON sync history with mutex serialization, retry logic for writes, and a trim-skip threshold. two issues: the initial count-estimation readFile is not retried (windows EBUSY risk on first append after a directory change), and the seenEntries deduplication in pruneSyncHistoryEntries uses reference equality so crash-duplicated lines survive pruning.
lib/codex-manager.ts	adds runFirstRunWizard and buildFirstRunWizardOptions with proper error isolation: assessment and import arms split into separate try/catch blocks, doctor case guarded, loadAccounts wrapped after each wizard action. minor: assessOpencodeAccountPool called without currentStorage: null in the wizard path, triggering a redundant loadAccounts() disk read.
lib/storage.ts	adds BackupRestoreAssessment, assessOpencodeAccountPool, assessNamedBackupRestoreCandidate, and redaction helpers. currentStorage defaulting to await loadAccounts() when option is undefined is correct but causes extra disk reads in first-run context. otherwise clean.
lib/ui/auth-menu.ts	adds FirstRunWizardOptions/FirstRunWizardAction types, showFirstRunWizard UI, and supporting formatting utilities. ANSI redaction patterns and menu rendering look correct. no issues found.
test/sync-history.test.ts	471-line new test file with good coverage: trim-skip threshold, reload-on-cap-cross, externally-cleared-file sentinel reset, EBUSY retry for append/trim writes, and configureSyncHistoryForTests null-reset semantics. missing: no test for EBUSY on the initial count-estimation readFile when historyEntryCountEstimate === null.
test/codex-manager-cli.test.ts	large test expansion covering first-run wizard flows, OpenCode import paths, assessment failures, backup-listing warning redaction, and health-summary restore/rollback failures. coverage is thorough for the happy paths and most failure paths through loginFlow and runFirstRunWizard.

Sequence Diagram

sequenceDiagram
    participant CLI as runAuthLogin
    participant W as runFirstRunWizard
    participant B as buildFirstRunWizardOptions
    participant S as storage.ts
    participant H as sync-history.ts

    CLI->>S: loadAccounts()
    S-->>CLI: null (first-run)
    CLI->>W: runFirstRunWizard(displaySettings)

    loop wizard loop
        W->>B: buildFirstRunWizardOptions()
        B->>S: listNamedBackups()
        B->>S: listRotatingBackups()
        B->>S: assessOpencodeAccountPool()<br/>[calls loadAccounts() internally — redundant]
        B-->>W: {namedBackupCount, hasOpencodeSource, …}
        W->>W: showFirstRunWizard(options)

        alt action = import-opencode
            W->>S: assessOpencodeAccountPool()<br/>[calls loadAccounts() internally — redundant]
            W->>S: importAccounts(path) [in runActionPanel]
        else action = restore
            W->>S: runBackupBrowserManager [unguarded]
        else action = doctor
            W->>W: runActionPanel → runDoctor [guarded try/catch]
        else action = settings
            W->>W: configureUnifiedSettings [unguarded]
        else action = login / skip
            W-->>CLI: {outcome: continue}
        else action = cancel
            W-->>CLI: {outcome: cancelled}
        end

        W->>S: loadAccounts() [guarded try/catch]
        note over W,S: returns if accounts.length > 0
    end

    CLI->>H: appendSyncHistoryEntry(entry)
    note over H: if historyEntryCountEstimate===null<br/>→ loadHistoryEntriesFromDisk()<br/>[NO retry — EBUSY risk on Windows]
    H->>H: withRetryableHistoryWrite → appendFile [retried]
    H->>H: trimHistoryFileIfNeeded [if estimate > 200]

Prompt To Fix All With AI

This is a comment left during a code review.
Path: lib/sync-history.ts
Line: 321-322

Comment:
**unretried count-estimation read drops entries on Windows EBUSY**

the `loadHistoryEntriesFromDisk` call for count estimation is not wrapped in any retry logic. `withRetryableHistoryWrite` guards all the write paths (appendFile, writeFile, rm), but the preceding read that seeds `historyEntryCountEstimate` when it is `null` (e.g. right after `configureSyncHistoryForTests`) has no equivalent protection. on Windows, antivirus/backup software can return `EBUSY` or `EPERM` on a `readFile` call to a newly active file, which causes:

1. `historyEntryCountEstimate` stays `null`
2. the exception propagates out of the lock before `appendFile` is ever reached
3. the calling entry is silently not written to disk
4. `historyEntryCountEstimate` remains `null` on the next call, which re-tries the read – the previously missed entry is gone for good

`withRetryableHistoryWrite` works for writes; a `withRetryableHistoryRead` equivalent (or at minimum a small inline retry loop with the same `RETRYABLE_WRITE_CODES` set) should wrap this call. there is no vitest case that mocks `fs.readFile` to throw `EBUSY` during the initial estimation phase and then asserts the entry is still appended.

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: lib/sync-history.ts
Line: 224-241

Comment:
**`seenEntries` reference-equality check is effectively dead code**

`seenEntries` is built with `new Set<SyncHistoryEntry>()` and used via `seenEntries.has(entry)`. every entry in the `entries` array comes from `parseHistoryContent` → `parseEntry(line)` → `cloneEntry(parsed)`, so each is a freshly allocated object with a unique reference. `seenEntries.has(entry)` therefore never returns `true` during the loop iteration, and `seenEntries` never prevents any entry from being re-added.

the practical consequence is that if a crash-retry path appended the same JSON line twice to the NDJSON file, `pruneSyncHistoryEntries` would not deduplicate them – both copies survive the prune. if content-based deduplication is the goal, the set should key on a serialized form (e.g. `JSON.stringify(entry)`). if the only intent is to guard against the same *object reference* appearing twice in the array (impossible with parsed results), the `seenEntries` machinery can be removed entirely to avoid misleading future readers.

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: lib/codex-manager.ts
Line: 4373-4374

Comment:
**redundant `loadAccounts()` inside `assessOpencodeAccountPool` during first-run wizard**

`assessOpencodeAccountPool()` is called here without a `currentStorage` option. inside the function, when `options.currentStorage` is `undefined`, it defaults to `await loadAccounts()` (storage.ts lines 2491–2494). in the first-run wizard context, `storage` is already known to be `null` (the wizard is only shown when `shouldShowFirstRunWizard` returns `true`, which requires `storage === null`). the extra `loadAccounts()` disk read inside the assessment is therefore redundant.

on Windows, antivirus can return `EBUSY` on this read, causing the assessment to throw – which is caught at line 4375–4381 and shows a warning, but the user misses the import flow. passing `currentStorage: null` explicitly avoids both the unnecessary disk read and the associated reliability risk:

```suggestion
			assessment = await assessOpencodeAccountPool({ currentStorage: null });
```

the same applies to the `buildFirstRunWizardOptions` call at line 4336 (`hasOpencodeSource = (await assessOpencodeAccountPool()) !== null`) – pass `{ currentStorage: null }` there too to avoid a second implicit `loadAccounts()` during option building.

How can I resolve this? If you propose a fix, please make it concise.

_{Last reviewed commit: "fix(sync): retry syn..."}

Greptile also left 2 inline comments on this PR.

chatgpt-codex-connector · 2026-03-16T08:47:58Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

lib/codex-manager.ts

…' into work/pr103-remediate-20260318 # Conflicts: # docs/getting-started.md # lib/codex-manager.ts # lib/ui/copy.ts # test/codex-manager-cli.test.ts # test/storage.test.ts

lib/codex-manager.ts

coderabbitai

Actionable comments posted: 3

♻️ Duplicate comments (3)

lib/codex-manager.ts (2)
4645-4668: ⚠️ Potential issue | 🟠 Major

confirm() can still abort this import flow.

lib/codex-manager.ts:4645-4650 sits outside both import error handlers. if confirm() throws on a tty/readline failure, this path skips the new redacted handling and exits codex auth login instead of returning to the menu like the assessment and import stages.
suggested change
-				const confirmed = await confirm(
-					`Import OpenCode accounts from ${backupLabel}?`,
-				);
-				if (!confirmed) {
-					continue;
-				}
-				try {
+				let importStage: "confirm" | "import" = "confirm";
+				try {
+					const confirmed = await confirm(
+						`Import OpenCode accounts from ${backupLabel}?`,
+					);
+					if (!confirmed) {
+						continue;
+					}
+					importStage = "import";
 					await runActionPanel(
 						"Import OpenCode Accounts",
 						`Importing from ${backupLabel}`,
@@
-				} catch (error) {
+				} catch (error) {
 					const errorLabel = collapseWhitespace(
 						formatRedactedFilesystemError(error),
 					);
-					console.error(`Import failed: ${errorLabel}`);
+					console.error(
+						`${importStage === "confirm" ? "Import confirmation failed" : "Import failed"}: ${errorLabel}`,
+					);
 				}
please add the matching regression in test/codex-manager-cli.test.ts:1 by forcing confirm() to throw and asserting the menu resumes. As per coding guidelines, lib/**: focus on auth rotation, windows filesystem IO, and concurrency. verify every change cites affected tests (vitest) and that new queues handle EBUSY/429 scenarios.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@lib/codex-manager.ts` around lines 4645 - 4668, The confirm() call before
importing accounts can throw (e.g., tty/readline failure) and currently sits
outside the import error handling so a thrown error will escape and exit the
CLI; wrap the confirm() invocation in the same error-handling flow (or add a
surrounding try/catch) so thrown errors are processed via
formatRedactedFilesystemError()/collapseWhitespace and the command returns to
the menu instead of exiting; update the block around confirm(),
runActionPanel(), and importAccounts() (symbols: confirm, runActionPanel,
importAccounts, formatRedactedFilesystemError, collapseWhitespace) to ensure
confirm errors are caught and handled the same way as import errors, and add a
vitest regression in test/codex-manager-cli.test.ts that forces confirm() to
throw and asserts the menu resumes.
4371-4409: ⚠️ Potential issue | 🟠 Major

add the missing vitest for wizard-side opencode probe throws.

lib/storage.ts:2470-2495 shows assessOpencodeAccountPool() can throw, and lib/codex-manager.ts:4406-4409 is now the guard that keeps first-run codex auth login alive when that happens. the pr summary still calls this gap out, and i do not see the matching regression in test/codex-manager-cli.test.ts:1. please add a case that forces the assess step to throw, asserts the warning stays redacted, and verifies the wizard keeps looping instead of exiting. As per coding guidelines, lib/**: focus on auth rotation, windows filesystem IO, and concurrency. verify every change cites affected tests (vitest) and that new queues handle EBUSY/429 scenarios.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@lib/codex-manager.ts` around lines 4371 - 4409, Add a vitest that simulates
assessOpencodeAccountPool throwing and verifies codex-manager handles it without
exiting: stub/mock assessOpencodeAccountPool to throw a filesystem error, stub
getRedactedFilesystemErrorLabel to return a predictable redacted string, invoke
the CLI path exercised by "import-opencode" (same flow exercised by
test/codex-manager-cli.test.ts), assert console.warn was called with the
redacted label returned by getRedactedFilesystemErrorLabel, and assert the
wizard/CLI loop did not exit (e.g. the command returned to prompt or continued
to the next loop iteration); reference functions assessOpencodeAccountPool,
getRedactedFilesystemErrorLabel, runActionPanel, and importAccounts when
locating code to stub and update the test to cover this regression.
test/codex-manager-cli.test.ts (1)
1201-1209: ⚠️ Potential issue | 🟠 Major

these redaction assertions still only check warn arg 0.

test/codex-manager-cli.test.ts:1201-1209 and test/codex-manager-cli.test.ts:1239-1240 only inspect call[0]. a raw windows path can still leak via call[1+] and the tests stay green.
proposed fix
-		expect(
-			warnSpy.mock.calls.some(([message]) =>
-				String(message).includes(
-					"Failed to refresh saved accounts after first-run action",
-				),
-			),
-		).toBe(true);
-		expect(
-			warnSpy.mock.calls.every((call) => !String(call[0]).includes("alice")),
-		).toBe(true);
+		const warningLines = warnSpy.mock.calls.map(flattenMockCallArgs);
+		expect(
+			warningLines.some((line) =>
+				line.includes("Failed to refresh saved accounts after first-run action"),
+			),
+		).toBe(true);
+		expect(warningLines.every((line) => !line.includes("alice"))).toBe(true);
@@
-		expect(
-			warnSpy.mock.calls.every((call) => !String(call[0]).includes("alice")),
-		).toBe(true);
+		const warningLines = warnSpy.mock.calls.map(flattenMockCallArgs);
+		expect(warningLines.every((line) => !line.includes("alice"))).toBe(true);
as per coding guidelines, test/**: tests must stay deterministic and use vitest. demand regression cases that reproduce concurrency bugs, token refresh races, and windows filesystem behavior. reject changes that mock real secrets or skip assertions.

Also applies to: 1235-1240
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@test/codex-manager-cli.test.ts` around lines 1201 - 1209, The failing tests
only inspect warnSpy.mock.calls[i][0], so sensitive data in later args can still
leak; update the two assertions that currently check only call[0] (the
expect(...includes("Failed to refresh saved accounts...")) and the
expect(...every(call => !String(call[0]).includes("alice")))) to scan all
arguments of each call (e.g., join or iterate over call.slice(0) / call) and
assert that none of the arguments contain the sensitive substrings ("Failed to
refresh saved accounts after first-run action", "alice") — apply the same change
to the duplicate assertion block around the second instance (the other expect at
1235-1240) and ensure warnSpy.mock.calls is fully inspected for every call
rather than only index 0.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@lib/sync-history.ts`:
- Around line 290-308: The cached historyEntryCountEstimate is unsafe under
multi-writer conditions; after the initial load (loadHistoryEntriesFromDisk) and
after appending (fs.appendFile to paths.historyPath), revalidate the on-disk
state instead of trusting the in-memory estimate: check the file mtime/size or
re-read the file (parse lines via serializeEntry/deserialize) and compute the
current entry count, then call trimHistoryFileIfNeeded(paths) based on that
authoritative count; update historyEntryCountEstimate from the re-read value.
Also add a vitest that seeds the file, initializes historyEntryCountEstimate,
then simulates an out-of-band append and verifies a subsequent append triggers
revalidation and trimming. Ensure any new retry logic for trimming/appending
handles transient EBUSY/429 errors with exponential backoff and cite the
affected tests in the commit message.

In `@test/codex-manager-cli.test.ts`:
- Around line 1215-1220: The test currently stubs showFirstRunWizard which
prevents exercising the real wizard re-render path when
assessOpencodeAccountPool throws; add an additional deterministic Vitest case
that does NOT spy on auth-menu.showFirstRunWizard and instead drives the real UI
flow by using selectMock to simulate user choices for "import-opencode" followed
by "cancel", while keeping loadAccountsMock as before and making
assessOpencodeAccountPool throw to trigger the re-render; ensure the new test
unregisters any spies, uses the real showFirstRunWizard function, asserts the
re-render occurred (e.g., that showFirstRunWizard was invoked twice or that
expected UI prompts were emitted), and keeps the test deterministic with Vitest
utilities.
- Around line 68-74: flattenMockCallArgs currently uses String(arg) which turns
objects into "[object Object]" and can miss leaked file paths; update
flattenMockCallArgs to robustly serialize arguments by handling Error instances
as `${err.name}: ${err.message}`, serializing objects with a JSON.stringify
replacer that redacts Windows-style absolute paths (e.g. matching
/[A-Za-z]:\\[^"\s]*/ → "[REDACTED_PATH]") and guards against circular references
using a WeakSet, then join the serialized pieces with spaces so object fields
containing paths are detected and redacted during tests.

---

Duplicate comments:
In `@lib/codex-manager.ts`:
- Around line 4645-4668: The confirm() call before importing accounts can throw
(e.g., tty/readline failure) and currently sits outside the import error
handling so a thrown error will escape and exit the CLI; wrap the confirm()
invocation in the same error-handling flow (or add a surrounding try/catch) so
thrown errors are processed via
formatRedactedFilesystemError()/collapseWhitespace and the command returns to
the menu instead of exiting; update the block around confirm(),
runActionPanel(), and importAccounts() (symbols: confirm, runActionPanel,
importAccounts, formatRedactedFilesystemError, collapseWhitespace) to ensure
confirm errors are caught and handled the same way as import errors, and add a
vitest regression in test/codex-manager-cli.test.ts that forces confirm() to
throw and asserts the menu resumes.
- Around line 4371-4409: Add a vitest that simulates assessOpencodeAccountPool
throwing and verifies codex-manager handles it without exiting: stub/mock
assessOpencodeAccountPool to throw a filesystem error, stub
getRedactedFilesystemErrorLabel to return a predictable redacted string, invoke
the CLI path exercised by "import-opencode" (same flow exercised by
test/codex-manager-cli.test.ts), assert console.warn was called with the
redacted label returned by getRedactedFilesystemErrorLabel, and assert the
wizard/CLI loop did not exit (e.g. the command returned to prompt or continued
to the next loop iteration); reference functions assessOpencodeAccountPool,
getRedactedFilesystemErrorLabel, runActionPanel, and importAccounts when
locating code to stub and update the test to cover this regression.

In `@test/codex-manager-cli.test.ts`:
- Around line 1201-1209: The failing tests only inspect
warnSpy.mock.calls[i][0], so sensitive data in later args can still leak; update
the two assertions that currently check only call[0] (the
expect(...includes("Failed to refresh saved accounts...")) and the
expect(...every(call => !String(call[0]).includes("alice")))) to scan all
arguments of each call (e.g., join or iterate over call.slice(0) / call) and
assert that none of the arguments contain the sensitive substrings ("Failed to
refresh saved accounts after first-run action", "alice") — apply the same change
to the duplicate assertion block around the second instance (the other expect at
1235-1240) and ensure warnSpy.mock.calls is fully inspected for every call
rather than only index 0.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: a04cfa8b-f5bc-4b9a-b891-12f06d6d6b28

📥 Commits

Reviewing files that changed from the base of the PR and between e332b9c and ebb527f.

📒 Files selected for processing (6)

docs/getting-started.md
lib/codex-manager.ts
lib/sync-history.ts
test/codex-manager-cli.test.ts
test/storage.test.ts
test/sync-history.test.ts

💤 Files with no reviewable changes (1)

test/storage.test.ts

📜 Review details

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Greptile Review

🧰 Additional context used

📓 Path-based instructions (3)

lib/**

⚙️ CodeRabbit configuration file

focus on auth rotation, windows filesystem IO, and concurrency. verify every change cites affected tests (vitest) and that new queues handle EBUSY/429 scenarios. check for logging that leaks tokens or emails.

Files:

lib/sync-history.ts
lib/codex-manager.ts

test/**

⚙️ CodeRabbit configuration file

tests must stay deterministic and use vitest. demand regression cases that reproduce concurrency bugs, token refresh races, and windows filesystem behavior. reject changes that mock real secrets or skip assertions.

Files:

test/sync-history.test.ts
test/codex-manager-cli.test.ts

docs/**

⚙️ CodeRabbit configuration file

keep README, SECURITY, and docs consistent with actual CLI flags and workflows. whenever behavior changes, require updated upgrade notes and mention new npm scripts.

Files:

docs/getting-started.md

lib/sync-history.ts

coderabbitai · 2026-03-18T02:25:06Z

test/codex-manager-cli.test.ts

+function flattenMockCallArgs(call: unknown[]): string {
+	return call
+		.map((arg) =>
+			arg instanceof Error ? `${arg.name}: ${arg.message}` : String(arg),
+		)
+		.join(" ");
+}


⚠️ Potential issue | 🟠 Major

redaction helper can hide leaked paths in object args.

in test/codex-manager-cli.test.ts:68-73, String(arg) turns objects into "[object Object]". if a logger call includes { error: "C:\\Users\\alice\\..." }, this helper will miss the leak.

proposed fix

+function serializeMockArg(arg: unknown): string { + if (arg instanceof Error) { + return `${arg.name}: ${arg.message}`; + } + if (typeof arg === "string") { + return arg; + } + try { + return JSON.stringify(arg); + } catch { + return String(arg); + } +} + function flattenMockCallArgs(call: unknown[]): string { return call - .map((arg) => - arg instanceof Error ? `${arg.name}: ${arg.message}` : String(arg), - ) + .map((arg) => serializeMockArg(arg)) .join(" "); }

as per coding guidelines, test/**: tests must stay deterministic and use vitest. demand regression cases that reproduce concurrency bugs, token refresh races, and windows filesystem behavior. reject changes that mock real secrets or skip assertions.

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

function flattenMockCallArgs(call: unknown[]): string {

return call

.map((arg) =>

arg instanceof Error ? `${arg.name}: ${arg.message}` : String(arg),

)

.join(" ");

}

function serializeMockArg(arg: unknown): string {

if (arg instanceof Error) {

return `${arg.name}: ${arg.message}`;

}

if (typeof arg === "string") {

return arg;

}

try {

return JSON.stringify(arg);

} catch {

return String(arg);

}

}

function flattenMockCallArgs(call: unknown[]): string {

return call

.map((arg) => serializeMockArg(arg))

.join(" ");

}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@test/codex-manager-cli.test.ts` around lines 68 - 74, flattenMockCallArgs currently uses String(arg) which turns objects into "[object Object]" and can miss leaked file paths; update flattenMockCallArgs to robustly serialize arguments by handling Error instances as `${err.name}: ${err.message}`, serializing objects with a JSON.stringify replacer that redacts Windows-style absolute paths (e.g. matching /[A-Za-z]:\\[^"\s]*/ → "[REDACTED_PATH]") and guards against circular references using a WeakSet, then join the serialized pieces with spaces so object fields containing paths are detected and redacted during tests.

coderabbitai · 2026-03-18T02:25:06Z

test/codex-manager-cli.test.ts

+		const authMenu = await import("../lib/ui/auth-menu.js");
+		const wizardSpy = vi
+			.spyOn(authMenu, "showFirstRunWizard")
+			.mockResolvedValueOnce({ type: "import-opencode" })
+			.mockResolvedValueOnce({ type: "cancel" });
+		loadAccountsMock.mockResolvedValue(null);


⚠️ Potential issue | 🟡 Minor

coverage gap: this test mocks away the real wizard-loop failure path.

in test/codex-manager-cli.test.ts:1215-1220, stubbing showFirstRunWizard means we do not exercise the actual first-run menu re-render when assessOpencodeAccountPool throws. add one deterministic case that uses the real wizard UI path (no spy on showFirstRunWizard) and drives import-opencode -> cancel via selectMock.

as per coding guidelines, test/**: tests must stay deterministic and use vitest. demand regression cases that reproduce concurrency bugs, token refresh races, and windows filesystem behavior. reject changes that mock real secrets or skip assertions.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@test/codex-manager-cli.test.ts` around lines 1215 - 1220, The test currently stubs showFirstRunWizard which prevents exercising the real wizard re-render path when assessOpencodeAccountPool throws; add an additional deterministic Vitest case that does NOT spy on auth-menu.showFirstRunWizard and instead drives the real UI flow by using selectMock to simulate user choices for "import-opencode" followed by "cancel", while keeping loadAccountsMock as before and making assessOpencodeAccountPool throw to trigger the re-render; ensure the new test unregisters any spies, uses the real showFirstRunWizard function, asserts the re-render occurred (e.g., that showFirstRunWizard was invoked twice or that expected UI prompts were emitted), and keeps the test deterministic with Vitest utilities.

lib/sync-history.ts

lib/codex-manager.ts

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (6)

lib/sync-history.ts (1)
290-308: ⚠️ Potential issue | 🟠 Major

cached entry count is still unsafe with out-of-band writers.

lib/sync-history.ts:290-308 trusts historyEntryCountEstimate after first init and does not revalidate disk state before deciding shouldTrim. if another process appends/prunes sync-history.ndjson, this process can skip trimming for too long and let history grow past policy. please revalidate on-disk state (mtime/size or bounded reread) before trim gating, and add a deterministic vitest that simulates out-of-band append after estimate warm-up.

As per coding guidelines, lib/**: focus on auth rotation, windows filesystem IO, and concurrency. verify every change cites affected tests (vitest) and that new queues handle EBUSY/429 scenarios. check for logging that leaks tokens or emails.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@lib/sync-history.ts` around lines 290 - 308, The code uses
historyEntryCountEstimate (warmed via loadHistoryEntriesFromDisk) and then
decides shouldTrim without revalidating on-disk state; change the logic around
the append+trim decision (the block that writes serializeEntry(entry) to
paths.historyPath and computes shouldTrim) to revalidate the file before gating
trim by either checking the file mtime/size or performing a bounded reread of
the file tail to update historyEntryCountEstimate (or compute a safe upper
bound) and only then call trimHistoryFileIfNeeded(paths); also ensure
trimHistoryFileIfNeeded still works correctly when called concurrently and keep
loadHistoryEntriesFromDisk and serializeEntry usage intact. Add a vitest that
warms the estimate, performs an out-of-band append to paths.historyPath
(simulating another process), then runs the code path that triggers trimming to
confirm trimming executes deterministically; include the new test alongside
existing sync-history tests and handle EBUSY-like file contention in the test
setup without leaking tokens/emails in logs.
lib/codex-manager.ts (2)
4850-4878: 🧹 Nitpick | 🔵 Trivial

remove the unreachable first-run replay branch.

lib/codex-manager.ts:4477 (Line 4477) sets firstRunWizardShownInLoop = true on the only startedFromMissingStorage path, so lib/codex-manager.ts:4850 (Line 4850) guard cannot pass. this is dead control flow and makes startup logic harder to reason about.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@lib/codex-manager.ts` around lines 4850 - 4878, The if-block guarded by
startedFromMissingStorage && !firstRunWizardShownInLoop && existingCount === 0
&& isInteractiveLoginMenuAvailable() (the block that calls
loadDashboardDisplaySettings, runFirstRunWizard, and attempts
refreshedAfterWizard) is unreachable because firstRunWizardShownInLoop is
already set true on the only startedFromMissingStorage path; remove this entire
unreachable branch and its inner logic, and also remove or simplify any
now-unused references to firstRunWizardShownInLoop (or related cleanup code) so
startup flow no longer contains dead control flow around runFirstRunWizard and
refreshedAfterWizard handling.
4316-4341: ⚠️ Potential issue | 🟠 Major

avoid double opencode probes and add transient fs retry in wizard path.

lib/codex-manager.ts:4336 (Line 4336) probes with assessOpencodeAccountPool() during menu option build, then lib/codex-manager.ts:4374 (Line 4374) probes again when import-opencode is selected. this doubles windows fs contention and still has no bounded retry/backoff for transient EBUSY/sharing-lock failures. it can also hide the import action when the pre-probe throws and hasOpencodeSource stays false.
proposed direction
+async function withTransientFsRetry<T>(op: () => Promise<T>, attempts = 3): Promise<T> {
+  let delayMs = 75;
+  for (let i = 0; i < attempts; i += 1) {
+    try {
+      return await op();
+    } catch (error) {
+      const code = (error as { code?: string })?.code;
+      const transient = code === "EBUSY" || code === "EPERM" || code === "EMFILE";
+      if (!transient || i === attempts - 1) throw error;
+      await new Promise((resolve) => setTimeout(resolve, delayMs));
+      delayMs *= 2;
+    }
+  }
+  throw new Error("unreachable");
+}
- hasOpencodeSource = (await assessOpencodeAccountPool()) !== null;
+ hasOpencodeSource = (await withTransientFsRetry(() => assessOpencodeAccountPool())) !== null;
- assessment = await assessOpencodeAccountPool();
+ assessment = await withTransientFsRetry(() => assessOpencodeAccountPool());
As per coding guidelines, "lib/**: focus on auth rotation, windows filesystem IO, and concurrency. verify every change cites affected tests (vitest) and that new queues handle EBUSY/429 scenarios. check for logging that leaks tokens or emails."

Also applies to: 4371-4418
test/codex-manager-cli.test.ts (3)
1241-1249: ⚠️ Potential issue | 🟠 Major

assert redaction against the full logger payload, not only call[0].

at test/codex-manager-cli.test.ts:1241, test/codex-manager-cli.test.ts:1279, test/codex-manager-cli.test.ts:1338, and test/codex-manager-cli.test.ts:4246, checks still miss secondary args and object args. a console.warn("...", err) call can leak full windows paths while these tests pass.
proposed fix
- expect(
-   warnSpy.mock.calls.every((call) => !String(call[0]).includes("alice")),
- ).toBe(true);
+ expect(
+   warnSpy.mock.calls.every(
+     (call) => !flattenMockCallArgs(call).includes("alice"),
+   ),
+ ).toBe(true);

- const warningOutput = warnSpy.mock.calls.flat().join("\n");
+ const warningOutput = warnSpy.mock.calls
+   .map((call) => flattenMockCallArgs(call))
+   .join("\n");
as per coding guidelines, test/**: tests must stay deterministic and use vitest. demand regression cases that reproduce concurrency bugs, token refresh races, and windows filesystem behavior. reject changes that mock real secrets or skip assertions.

Also applies to: 1279-1280, 1338-1343, 4246-4255
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@test/codex-manager-cli.test.ts` around lines 1241 - 1249, The assertions
currently inspect only the first argument of warnSpy.mock.calls (call[0]) which
misses secondary args or object args that may leak sensitive data; update the
checks (those using warnSpy.mock.calls.some(...) and .every(...)) to inspect the
entire call payload (e.g., stringify or join all call elements such as using
JSON.stringify(call) or call.map(String).join(" ")) and then assert the
presence/absence of the target text (e.g., "Failed to refresh saved accounts
after first-run action" and that "alice" is not present) against that full
payload; apply the same change to the other occurrences referenced in this test
file so each assertion covers all arguments in warnSpy.mock.calls.
68-74: ⚠️ Potential issue | 🟠 Major

serialize mock args deeply for redaction checks.

String(arg) hides structured payloads as "[object Object]", so path leaks in object fields can bypass checks in test/codex-manager-cli.test.ts:68. this weakens windows redaction coverage.
proposed fix
+function serializeMockArg(arg: unknown): string {
+	if (arg instanceof Error) return `${arg.name}: ${arg.message}`;
+	if (typeof arg === "string") return arg;
+	const seen = new WeakSet<object>();
+	const redactWindowsPath = (value: string) =>
+		value.replace(/[A-Za-z]:\\[^"'\s)]+/g, "[REDACTED_PATH]");
+	try {
+		return JSON.stringify(arg, (_key, value) => {
+			if (typeof value === "string") return redactWindowsPath(value);
+			if (value && typeof value === "object") {
+				if (seen.has(value as object)) return "[Circular]";
+				seen.add(value as object);
+			}
+			return value;
+		});
+	} catch {
+		return String(arg);
+	}
+}
+
 function flattenMockCallArgs(call: unknown[]): string {
 	return call
-		.map((arg) =>
-			arg instanceof Error ? `${arg.name}: ${arg.message}` : String(arg),
-		)
+		.map((arg) => serializeMockArg(arg))
 		.join(" ");
 }
as per coding guidelines, test/**: tests must stay deterministic and use vitest. demand regression cases that reproduce concurrency bugs, token refresh races, and windows filesystem behavior. reject changes that mock real secrets or skip assertions.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@test/codex-manager-cli.test.ts` around lines 68 - 74, The helper
flattenMockCallArgs currently uses String(arg) which collapses objects to
"[object Object]" and hides fields; update flattenMockCallArgs to serialize
arguments deeply so object fields are visible for redaction checks: in the
flattenMockCallArgs function, detect Error instances (keep `${err.name}:
${err.message}`) and for other values use a deep serializer (e.g.,
JSON.stringify with a circular-safe replacer or Node's util.inspect) to produce
full object payloads (falling back to String(arg) if serialization fails) so
tests in test/codex-manager-cli.test.ts can assert on nested paths instead of
being bypassed by "[object Object]".
1253-1260: ⚠️ Potential issue | 🟡 Minor

do not mock away the wizard re-render path in this probe-failure regression.

test/codex-manager-cli.test.ts:1253 stubs showFirstRunWizard, so it does not exercise the real first-run render/re-render flow when OpenCode probing fails. that leaves the windows EBUSY wizard-loop behavior under-tested.

as per coding guidelines, test/**: tests must stay deterministic and use vitest. demand regression cases that reproduce concurrency bugs, token refresh races, and windows filesystem behavior. reject changes that mock real secrets or skip assertions.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@test/codex-manager-cli.test.ts` around lines 1253 - 1260, The test should not
stub out the real first-run wizard flow: remove the mockResolvedValueOnce stubs
on authMenu.showFirstRunWizard so the actual showFirstRunWizard implementation
runs (or replace the stub with a call-through). Concretely, in the test remove
the .mockResolvedValueOnce(...) calls on the spy for showFirstRunWizard (or
replace them with a call-through wrapper that invokes the original function:
capture const orig = authMenu.showFirstRunWizard and use vi.spyOn(authMenu,
"showFirstRunWizard").mockImplementation((...a) => orig.apply(authMenu, a))).
This ensures the real render/re-render path and EBUSY windows behavior are
exercised while still letting you observe the call.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@lib/sync-history.ts`:
- Around line 293-309: The production write paths can fail on transient Windows
file-lock errors; wrap the append/rewrite/trim write operations in bounded
retry+exponential backoff (same behavior as __resetSyncHistoryForTests): up to 5
attempts with delay 25 * 2^attempt ms, and only retry for transient error codes
(EBUSY/EPERM). Apply this retry logic around the fs.appendFile call in
appendSyncHistoryEntry, around the file-writes inside trimHistoryFileIfNeeded,
and around rewriteLatestEntry so each physical write is retried before failing.
Add a Vitest that stubs/mock fs to throw an EBUSY/EPERM on the first N-1
attempts and succeed on the final attempt to verify recovery.

---

Duplicate comments:
In `@lib/codex-manager.ts`:
- Around line 4850-4878: The if-block guarded by startedFromMissingStorage &&
!firstRunWizardShownInLoop && existingCount === 0 &&
isInteractiveLoginMenuAvailable() (the block that calls
loadDashboardDisplaySettings, runFirstRunWizard, and attempts
refreshedAfterWizard) is unreachable because firstRunWizardShownInLoop is
already set true on the only startedFromMissingStorage path; remove this entire
unreachable branch and its inner logic, and also remove or simplify any
now-unused references to firstRunWizardShownInLoop (or related cleanup code) so
startup flow no longer contains dead control flow around runFirstRunWizard and
refreshedAfterWizard handling.

In `@lib/sync-history.ts`:
- Around line 290-308: The code uses historyEntryCountEstimate (warmed via
loadHistoryEntriesFromDisk) and then decides shouldTrim without revalidating
on-disk state; change the logic around the append+trim decision (the block that
writes serializeEntry(entry) to paths.historyPath and computes shouldTrim) to
revalidate the file before gating trim by either checking the file mtime/size or
performing a bounded reread of the file tail to update historyEntryCountEstimate
(or compute a safe upper bound) and only then call
trimHistoryFileIfNeeded(paths); also ensure trimHistoryFileIfNeeded still works
correctly when called concurrently and keep loadHistoryEntriesFromDisk and
serializeEntry usage intact. Add a vitest that warms the estimate, performs an
out-of-band append to paths.historyPath (simulating another process), then runs
the code path that triggers trimming to confirm trimming executes
deterministically; include the new test alongside existing sync-history tests
and handle EBUSY-like file contention in the test setup without leaking
tokens/emails in logs.

In `@test/codex-manager-cli.test.ts`:
- Around line 1241-1249: The assertions currently inspect only the first
argument of warnSpy.mock.calls (call[0]) which misses secondary args or object
args that may leak sensitive data; update the checks (those using
warnSpy.mock.calls.some(...) and .every(...)) to inspect the entire call payload
(e.g., stringify or join all call elements such as using JSON.stringify(call) or
call.map(String).join(" ")) and then assert the presence/absence of the target
text (e.g., "Failed to refresh saved accounts after first-run action" and that
"alice" is not present) against that full payload; apply the same change to the
other occurrences referenced in this test file so each assertion covers all
arguments in warnSpy.mock.calls.
- Around line 68-74: The helper flattenMockCallArgs currently uses String(arg)
which collapses objects to "[object Object]" and hides fields; update
flattenMockCallArgs to serialize arguments deeply so object fields are visible
for redaction checks: in the flattenMockCallArgs function, detect Error
instances (keep `${err.name}: ${err.message}`) and for other values use a deep
serializer (e.g., JSON.stringify with a circular-safe replacer or Node's
util.inspect) to produce full object payloads (falling back to String(arg) if
serialization fails) so tests in test/codex-manager-cli.test.ts can assert on
nested paths instead of being bypassed by "[object Object]".
- Around line 1253-1260: The test should not stub out the real first-run wizard
flow: remove the mockResolvedValueOnce stubs on authMenu.showFirstRunWizard so
the actual showFirstRunWizard implementation runs (or replace the stub with a
call-through). Concretely, in the test remove the .mockResolvedValueOnce(...)
calls on the spy for showFirstRunWizard (or replace them with a call-through
wrapper that invokes the original function: capture const orig =
authMenu.showFirstRunWizard and use vi.spyOn(authMenu,
"showFirstRunWizard").mockImplementation((...a) => orig.apply(authMenu, a))).
This ensures the real render/re-render path and EBUSY windows behavior are
exercised while still letting you observe the call.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: f5845167-2d5c-4391-8861-15a986edc805

📥 Commits

Reviewing files that changed from the base of the PR and between ebb527f and 42a9197.

📒 Files selected for processing (4)

lib/codex-manager.ts
lib/sync-history.ts
test/codex-manager-cli.test.ts
test/sync-history.test.ts

📜 Review details

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Greptile Review

🧰 Additional context used

📓 Path-based instructions (2)

test/**

⚙️ CodeRabbit configuration file

tests must stay deterministic and use vitest. demand regression cases that reproduce concurrency bugs, token refresh races, and windows filesystem behavior. reject changes that mock real secrets or skip assertions.

Files:

test/sync-history.test.ts
test/codex-manager-cli.test.ts

lib/**

⚙️ CodeRabbit configuration file

focus on auth rotation, windows filesystem IO, and concurrency. verify every change cites affected tests (vitest) and that new queues handle EBUSY/429 scenarios. check for logging that leaks tokens or emails.

Files:

lib/codex-manager.ts
lib/sync-history.ts

🔇 Additional comments (3)

test/sync-history.test.ts (1)

271-371: good deterministic coverage for estimate lifecycle paths.

test/sync-history.test.ts:271-371 is solid vitest coverage for the new append-estimate behavior and reset semantics.

lib/codex-manager.ts (2)

1286-1286: good redaction hardening on failure output.

lib/codex-manager.ts:1286 (Line 1286) and lib/codex-manager.ts:4679 (Line 4679) now normalize and redact filesystem error details before surfacing them. this is the right direction for safe cli diagnostics.

Also applies to: 4679-4681

4466-4497: tests exist for first-run wizard gating and EBUSY transient fallback, but verify via behavior rather than direct variable assertions.

tests at test/codex-manager-cli.test.ts:930 and test/codex-manager-cli.test.ts:1253 verify that the first-run wizard is shown once when starting from missing storage and remains open when OpenCode import probing hits EBUSY (wizard called twice, second showing after failure). test at test/codex-manager-cli.test.ts:1448 verifies the recovery menu appears after first-run wizard creates accounts.

the implementation at lib/codex-manager.ts:4466-4497 uses skipEmptyStorageRecoveryMenu and firstRunWizardShownInLoop to control these flows, and existing tests validate the behavior end-to-end through mock sequencing (e.g., wizardSpy.toHaveBeenCalledTimes(2) after EBUSY) rather than asserting on variable state directly. this approach covers the transient EBUSY scenarios referenced in the coding guidelines and verifies recovery paths remain unbroken.

lib/sync-history.ts

greptile-apps · 2026-03-18T18:23:25Z

lib/sync-history.ts

+			historyEntryCountEstimate = (await loadHistoryEntriesFromDisk(paths)).length;
+		}


unretried count-estimation read drops entries on Windows EBUSY

the loadHistoryEntriesFromDisk call for count estimation is not wrapped in any retry logic. withRetryableHistoryWrite guards all the write paths (appendFile, writeFile, rm), but the preceding read that seeds historyEntryCountEstimate when it is null (e.g. right after configureSyncHistoryForTests) has no equivalent protection. on Windows, antivirus/backup software can return EBUSY or EPERM on a readFile call to a newly active file, which causes:

historyEntryCountEstimate stays null

the exception propagates out of the lock before appendFile is ever reached

the calling entry is silently not written to disk

historyEntryCountEstimate remains null on the next call, which re-tries the read – the previously missed entry is gone for good

withRetryableHistoryWrite works for writes; a withRetryableHistoryRead equivalent (or at minimum a small inline retry loop with the same RETRYABLE_WRITE_CODES set) should wrap this call. there is no vitest case that mocks fs.readFile to throw EBUSY during the initial estimation phase and then asserts the entry is still appended.

Prompt To Fix With AI

This is a comment left during a code review. Path: lib/sync-history.ts Line: 321-322 Comment: **unretried count-estimation read drops entries on Windows EBUSY** the `loadHistoryEntriesFromDisk` call for count estimation is not wrapped in any retry logic. `withRetryableHistoryWrite` guards all the write paths (appendFile, writeFile, rm), but the preceding read that seeds `historyEntryCountEstimate` when it is `null` (e.g. right after `configureSyncHistoryForTests`) has no equivalent protection. on Windows, antivirus/backup software can return `EBUSY` or `EPERM` on a `readFile` call to a newly active file, which causes: 1. `historyEntryCountEstimate` stays `null` 2. the exception propagates out of the lock before `appendFile` is ever reached 3. the calling entry is silently not written to disk 4. `historyEntryCountEstimate` remains `null` on the next call, which re-tries the read – the previously missed entry is gone for good `withRetryableHistoryWrite` works for writes; a `withRetryableHistoryRead` equivalent (or at minimum a small inline retry loop with the same `RETRYABLE_WRITE_CODES` set) should wrap this call. there is no vitest case that mocks `fs.readFile` to throw `EBUSY` during the initial estimation phase and then asserts the entry is still appended. How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-03-18T18:23:25Z

lib/sync-history.ts

+export function pruneSyncHistoryEntries(
+	entries: SyncHistoryEntry[],
+	maxEntries: number = MAX_HISTORY_ENTRIES,
+): PrunedSyncHistory {
+	if (entries.length === 0) {
+		return { entries: [], removed: 0, latest: null };
+	}
+
+	const boundedMaxEntries = Math.max(0, maxEntries);
+	const latestByKind = new Map<SyncHistoryKind, SyncHistoryEntry>();
+	for (let index = entries.length - 1; index >= 0; index -= 1) {
+		const entry = entries[index];
+		if (!entry || latestByKind.has(entry.kind)) {
+			continue;
+		}
+		latestByKind.set(entry.kind, entry);
+	}
+


seenEntries reference-equality check is effectively dead code

seenEntries is built with new Set<SyncHistoryEntry>() and used via seenEntries.has(entry). every entry in the entries array comes from parseHistoryContent → parseEntry(line) → cloneEntry(parsed), so each is a freshly allocated object with a unique reference. seenEntries.has(entry) therefore never returns true during the loop iteration, and seenEntries never prevents any entry from being re-added.

the practical consequence is that if a crash-retry path appended the same JSON line twice to the NDJSON file, pruneSyncHistoryEntries would not deduplicate them – both copies survive the prune. if content-based deduplication is the goal, the set should key on a serialized form (e.g. JSON.stringify(entry)). if the only intent is to guard against the same object reference appearing twice in the array (impossible with parsed results), the seenEntries machinery can be removed entirely to avoid misleading future readers.

Prompt To Fix With AI

This is a comment left during a code review. Path: lib/sync-history.ts Line: 224-241 Comment: **`seenEntries` reference-equality check is effectively dead code** `seenEntries` is built with `new Set<SyncHistoryEntry>()` and used via `seenEntries.has(entry)`. every entry in the `entries` array comes from `parseHistoryContent` → `parseEntry(line)` → `cloneEntry(parsed)`, so each is a freshly allocated object with a unique reference. `seenEntries.has(entry)` therefore never returns `true` during the loop iteration, and `seenEntries` never prevents any entry from being re-added. the practical consequence is that if a crash-retry path appended the same JSON line twice to the NDJSON file, `pruneSyncHistoryEntries` would not deduplicate them – both copies survive the prune. if content-based deduplication is the goal, the set should key on a serialized form (e.g. `JSON.stringify(entry)`). if the only intent is to guard against the same *object reference* appearing twice in the array (impossible with parsed results), the `seenEntries` machinery can be removed entirely to avoid misleading future readers. How can I resolve this? If you propose a fix, please make it concise.

ndycode · 2026-03-19T12:50:58Z

Superseded by clean replacement #129.

This branch was rebuilt from the fresh restack audit onto clean/pr102-opencode-import-replay-final so reviewers can look at the intended first-run wizard slice only. The replacement keeps the wizard behavior, includes the successful OpenCode-import handoff fix, and carries the broader validation run (npm run lint, npm run typecheck, npm run build, npm test, plus focused wizard/storage/sync/menu packs). Closing this bloated 43-file PR in favor of the clean 9-file replacement.

ndycode added 27 commits March 15, 2026 23:19

fix(auth): define reset and delete safety flows

af7f137

feat(auth): add backup restore manager

f48ee06

feat(auth): prompt for recovery on startup

12074e2

feat(sync): add sync center and status surface

eeecf6e

fix(auth): clean startup recovery prompt branch

ccb06f6

feat(settings): productize stable and advanced controls

d0a80b5

fix(auth): drop recovery fallback from settings slice

310325d

docs(auth): align front-door flows with shipped workflows

01ee35b

feat(sync): add result history and last-result log

44bb909

fix(sync): bound sync history recovery reads

d57449f

feat(auth): add backup browser and rotating restore support

56dac78

feat(auth): add restore preview

17cec7e

feat(auth): add restore preview

5ddee4d

feat(auth): snapshot before destructive actions

105730c

feat(sync): add manual rollback

d262177

feat(auth): add retention and recovery doctor

46efc1e

feat(ui): add health summary dashboard

1264cc1

fix(ui): harden dashboard health summary

6decea7

feat(auth): add explicit opencode import adapter

4de04bb

feat(auth): surface opencode import flow

85ee121

feat(auth): surface opencode import flow

91d2ce0

docs(auth): generalize companion import wording

cad31c4

fix(auth): align opencode import replay with current contracts

f1bcc56

feat(auth): add first-run setup wizard

ac2be34

test(auth): cover first-run wizard branches

16ba251

feat(auth): finalize first-run wizard flow

64a8ead

fix(auth): align first-run wizard recovery flow

f5b74dc

Copilot AI review requested due to automatic review settings March 16, 2026 08:47

greptile-apps bot reviewed Mar 16, 2026

View reviewed changes

lib/codex-manager.ts Show resolved Hide resolved

ndycode and others added 13 commits March 18, 2026 09:13

fix(sync): serialize rollback restore

01ba63d

feat(auth): add explicit opencode import adapter

a292205

feat(auth): surface opencode import flow

b099f06

feat(auth): surface opencode import flow

90583a0

docs(auth): generalize companion import wording

a961070

fix(auth): align opencode import replay with current contracts

238003e

fix(auth): keep OpenCode import failures in login loop

694a19c

fix(storage): harden opencode import assessment

c6f71d7

fix(auth): harden opencode import recovery

a31f19b

fix(auth): harden OpenCode import source handling

9567e47

fix(auth): clarify OpenCode import limit failures

e71d34e

test(auth): cover OpenCode import docs and detection

39d6e2f

test(auth): cover HOME fallback for OpenCode detection

3b1ba04

ndycode force-pushed the fresh/18-import-adapter-opencode branch from 5bd459c to 3b1ba04 Compare March 18, 2026 01:19

ndycode added 2 commits March 18, 2026 10:10

Merge remote-tracking branch 'origin/fresh/18-import-adapter-opencode…

0d47e39

…' into work/pr103-remediate-20260318 # Conflicts: # docs/getting-started.md # lib/codex-manager.ts # lib/ui/copy.ts # test/codex-manager-cli.test.ts # test/storage.test.ts

test(auth): align empty health summary expectation

ebb527f

greptile-apps bot reviewed Mar 18, 2026

View reviewed changes

lib/codex-manager.ts Show resolved Hide resolved

lib/codex-manager.ts Outdated Show resolved Hide resolved

coderabbitai bot requested changes Mar 18, 2026

View reviewed changes

fix(wizard): separate OpenCode assessment failures

1527132

greptile-apps bot reviewed Mar 18, 2026

View reviewed changes

lib/sync-history.ts Outdated Show resolved Hide resolved

fix(sync): reset trim estimate after cleared history reload

4eed3b0

greptile-apps bot reviewed Mar 18, 2026

View reviewed changes

lib/codex-manager.ts Outdated Show resolved Hide resolved

fix(wizard): guard doctor panel failures

42a9197

coderabbitai bot requested changes Mar 18, 2026

View reviewed changes

lib/sync-history.ts Outdated Show resolved Hide resolved

ndycode changed the base branch from fresh/18-import-adapter-opencode to dev March 18, 2026 06:47

fix(sync): retry sync-history production writes

edafe3a

greptile-apps bot reviewed Mar 18, 2026

View reviewed changes

ndycode closed this Mar 19, 2026

ndycode deleted the fresh/19-first-run-wizard branch March 19, 2026 16:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(auth): add first-run setup wizard#103

feat(auth): add first-run setup wizard#103
ndycode wants to merge 67 commits intodevfrom
fresh/19-first-run-wizard

ndycode commented Mar 16, 2026 •

edited by greptile-apps bot

Loading

Uh oh!

chatgpt-codex-connector bot commented Mar 16, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot Mar 18, 2026

Uh oh!

coderabbitai bot Mar 18, 2026

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

greptile-apps bot Mar 18, 2026

Uh oh!

greptile-apps bot Mar 18, 2026

Uh oh!

ndycode commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

-function flattenMockCallArgs(call: unknown[]): string {
-	return call
-		.map((arg) =>
-			arg instanceof Error ? `${arg.name}: ${arg.message}` : String(arg),
-		)
-		.join(" ");
-}
+function serializeMockArg(arg: unknown): string {
+	if (arg instanceof Error) {
+		return `${arg.name}: ${arg.message}`;
+	}
+	if (typeof arg === "string") {
+		return arg;
+	}
+	try {
+		return JSON.stringify(arg);
+	} catch {
+		return String(arg);
+	}
+}
+function flattenMockCallArgs(call: unknown[]): string {
+	return call
+		.map((arg) => serializeMockArg(arg))
+		.join(" ");
+}

		historyEntryCountEstimate = (await loadHistoryEntriesFromDisk(paths)).length;
		}

Conversation

ndycode commented Mar 16, 2026 • edited by greptile-apps bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What Changed

Validation

Docs and Governance Checklist

Risk and Rollback

Additional Notes

note: greptile review for oc-chatgpt-multi-auth. cite files like lib/foo.ts:123. confirm regression tests + windows concurrency/token redaction coverage.

Greptile Summary

Confidence Score: 3/5

Important Files Changed

Sequence Diagram

Uh oh!

chatgpt-codex-connector bot commented Mar 16, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

greptile-apps bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

ndycode commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ndycode commented Mar 16, 2026 •

edited by greptile-apps bot

Loading

note: greptile review for oc-chatgpt-multi-auth. cite files like `lib/foo.ts:123`. confirm regression tests + windows concurrency/token redaction coverage.