fix: review-agent duplicate bot responses due to missing API pagination by bryan-cox · Pull Request #74690 · openshift/release

bryan-cox · 2026-02-10T15:01:31Z

Summary

The review-agent periodic CI job (runs every 3 hours) is posting duplicate responses to PR comments. On openshift/hypershift#7561, the bot posted 8 nearly identical responses to the same question between Feb 9-10. This PR fixes three root causes.

Problem

Empirical evidence from PR #7561

Metric	Value
Total issue comments	38 (spans 2 API pages)
Without `--paginate`	30 returned (GitHub default page size)
Bot comments visible on page 1	7 of 15
Bot comments on page 2 (invisible)	8 — all the Feb 9-10 duplicates
Most recent bot comment on page 1	Feb 6 (`2026-02-06T08:24:15Z`)
Actual most recent bot comment	Feb 10 (`2026-02-10T14:09:33Z`)

One human comment on Feb 9 at 18:23 UTC triggered 8 duplicate bot responses across 5 periodic runs:

#	Timestamp	Gap from previous
1	`2026-02-09T20:10:57Z`	— (first response)
2	`2026-02-09T23:07:36Z`	~3h (cross-run)
3	`2026-02-09T23:09:12Z`	96s (within-run)
4	`2026-02-10T08:07:05Z`	~9h (cross-run)
5	`2026-02-10T08:07:37Z`	32s (within-run)
6	`2026-02-10T11:06:45Z`	~3h (cross-run)
7	`2026-02-10T11:07:56Z`	71s (within-run)
8	`2026-02-10T14:09:33Z`	~3h (cross-run)

Two distinct patterns are visible: cross-run duplicates (~3h apart, matching the periodic schedule) and within-run duplicates (30-96s apart, within a single session).

Root cause 1: Missing `--paginate` (cross-run duplicates)

comment_analyzer.py fetches issue comments without --paginate in two places (analyze_review_bodies and analyze_issue_comments). GitHub's REST API returns max 30 results per page by default. On PRs with 30+ comments, only page 1 (oldest comments) is returned. Bot replies accumulate at the end of the comment list, landing on page 2+.

The cascade: Analyzer misses bot replies on page 2 → last_bot_time is stale/None → all recent human comments flagged as needing attention → Claude re-responds → more comments push bot replies further onto page 2 → problem worsens each run.

Note: check_replied.py (the dedup script used by the address-reviews command) already uses --paginate correctly (line 158). The analyzer just missed it.

Root cause 2: `check_replied.py` path not reliably found in CI (within-run duplicates)

CLAUDE_PLUGIN_ROOT is exported in the shell script (line 12), but Claude Code's Bash tool initializes a new shell from the user's profile, not from the parent process environment. If the env var isn't inherited, the path resolution falls back to searching ~/.claude/plugins (doesn't exist in CI), and Claude may proceed without the dedup check.

Root cause 3: API propagation delay (within-run duplicates)

When Claude posts reply A and immediately checks for existing replies before posting reply B, GitHub's API may not have propagated reply A yet. This explains the 30-96 second paired duplicates within a single run.

Changes

Single file: hypershift-review-agent-process-commands.sh

Add --paginate to both gh api calls in comment_analyzer.py (analyze_review_bodies and analyze_issue_comments) so all comments across all pages are fetched
Add explicit check_replied.py path in REVIEW_CONTEXT so Claude uses the script directly instead of relying on CLAUDE_PLUGIN_ROOT env var inheritance
Add within-session file-based dedup tracking instructions in REVIEW_CONTEXT to guard against API propagation delays between replies

Test plan

Verify --paginate now returns all 38 comments for PR Fix statusreconciler deployment to be valid #7561 (vs 30 without it)
Verify last_bot_time correctly shows Feb 10 timestamp instead of Feb 6
Confirm PRs with <30 comments still work correctly (pagination is a no-op for single-page results)
Monitor next 2-3 periodic runs on PR Fix statusreconciler deployment to be valid #7561 for absence of new duplicate responses
Verify /tmp/ai-helpers/plugins/utils/scripts/check_replied.py exists after the clone step in CI

🤖 Generated with Claude Code

The review-agent periodic job (every 3 hours) posted 8 nearly identical responses to the same question on PR openshift/hypershift#7561 between Feb 9-10. Root cause: comment_analyzer.py fetches issue comments without --paginate, so GitHub returns only the first 30 (default page size). Bot replies land at the end of the comment list, making them invisible to the analyzer on PRs with 30+ comments. Empirical evidence from PR openshift#7561: - Total comments: 38 (spans 2 pages) - Without --paginate: 30 returned, 7 bot comments visible - With --paginate: 38 returned, all 15 bot comments visible - The 8 bot comments from Feb 9-10 are ALL on page 2 - Analyzer saw last bot reply as Feb 6 instead of Feb 10 - Every human comment since Feb 6 was re-flagged each run The cascade: page 1 misses recent bot replies → last_bot_time is stale → all newer human comments flagged → Claude re-responds → more comments push bot replies further onto page 2 → problem worsens. Within-run duplicates (30-70s apart) are caused by two additional issues: (1) check_replied.py path resolution relies on CLAUDE_PLUGIN_ROOT env var which may not be inherited by Claude's Bash tool in CI, and (2) GitHub API propagation delay means a just-posted reply may not be visible when checking for duplicates before the next reply. Changes: - Add --paginate to both gh api calls in comment_analyzer.py (analyze_review_bodies and analyze_issue_comments) - Add explicit check_replied.py path in REVIEW_CONTEXT so Claude doesn't depend on CLAUDE_PLUGIN_ROOT inheritance - Add within-session file-based dedup tracking to guard against API propagation delays Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

openshift-ci-robot · 2026-02-10T15:04:03Z

[REHEARSALNOTIFIER]
@bryan-cox: the pj-rehearse plugin accommodates running rehearsal tests for the changes in this PR. Expand 'Interacting with pj-rehearse' for usage details. The following rehearsable tests have been affected by this change:

Test name	Repo	Type	Reason
pull-ci-openshift-hypershift-main-review-agent-single-pr	openshift/hypershift	presubmit	Registry content changed
pull-ci-openshift-hypershift-release-4.23-review-agent-single-pr	openshift/hypershift	presubmit	Registry content changed
pull-ci-openshift-hypershift-release-4.22-review-agent-single-pr	openshift/hypershift	presubmit	Registry content changed
periodic-ci-openshift-hypershift-main-periodic-review-agent	N/A	periodic	Registry content changed
periodic-ci-openshift-hypershift-release-4.22-periodic-review-agent	N/A	periodic	Registry content changed

Prior to this PR being merged, you will need to either run and acknowledge or opt to skip these rehearsals.

Interacting with pj-rehearse

Comment: /pj-rehearse to run up to 5 rehearsals
Comment: /pj-rehearse skip to opt-out of rehearsals
Comment: /pj-rehearse {test-name}, with each test separated by a space, to run one or more specific rehearsals
Comment: /pj-rehearse more to run up to 10 rehearsals
Comment: /pj-rehearse max to run up to 25 rehearsals
Comment: /pj-rehearse auto-ack to run up to 5 rehearsals, and add the rehearsals-ack label on success
Comment: /pj-rehearse list to get an up-to-date list of affected jobs
Comment: /pj-rehearse abort to abort all active rehearsals
Comment: /pj-rehearse network-access-allowed to allow rehearsals of tests that have the restrict_network_access field set to false. This must be executed by an openshift org member who is not the PR author

Once you are satisfied with the results of the rehearsals, comment: /pj-rehearse ack to unblock merge. When the rehearsals-ack label is present on your PR, merge will no longer be blocked by rehearsals.
If you would like the rehearsals-ack label removed, comment: /pj-rehearse reject to re-block merging.

bryan-cox · 2026-02-10T15:16:00Z

/pj-rehearse periodic-ci-openshift-hypershift-main-periodic-review-agent

openshift-ci-robot · 2026-02-10T15:16:04Z

@bryan-cox: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

openshift-ci · 2026-02-10T15:27:56Z

@bryan-cox: all tests passed!

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

muraee · 2026-02-11T16:49:23Z

/lgtm

bryan-cox · 2026-02-11T16:49:53Z

/pj-rehearse ack

openshift-ci-robot · 2026-02-11T16:49:55Z

@bryan-cox: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

openshift-ci · 2026-02-11T16:53:39Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: bryan-cox, muraee

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~ci-operator/step-registry/hypershift/review-agent/process/OWNERS~~ [bryan-cox]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

…openshift#74690) The review-agent periodic job (every 3 hours) posted 8 nearly identical responses to the same question on PR openshift/hypershift#7561 between Feb 9-10. Root cause: comment_analyzer.py fetches issue comments without --paginate, so GitHub returns only the first 30 (default page size). Bot replies land at the end of the comment list, making them invisible to the analyzer on PRs with 30+ comments. Empirical evidence from PR openshift#7561: - Total comments: 38 (spans 2 pages) - Without --paginate: 30 returned, 7 bot comments visible - With --paginate: 38 returned, all 15 bot comments visible - The 8 bot comments from Feb 9-10 are ALL on page 2 - Analyzer saw last bot reply as Feb 6 instead of Feb 10 - Every human comment since Feb 6 was re-flagged each run The cascade: page 1 misses recent bot replies → last_bot_time is stale → all newer human comments flagged → Claude re-responds → more comments push bot replies further onto page 2 → problem worsens. Within-run duplicates (30-70s apart) are caused by two additional issues: (1) check_replied.py path resolution relies on CLAUDE_PLUGIN_ROOT env var which may not be inherited by Claude's Bash tool in CI, and (2) GitHub API propagation delay means a just-posted reply may not be visible when checking for duplicates before the next reply. Changes: - Add --paginate to both gh api calls in comment_analyzer.py (analyze_review_bodies and analyze_issue_comments) - Add explicit check_replied.py path in REVIEW_CONTEXT so Claude doesn't depend on CLAUDE_PLUGIN_ROOT inheritance - Add within-session file-based dedup tracking to guard against API propagation delays Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

openshift-ci Bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 10, 2026

openshift-ci Bot requested review from csrwng and enxebre February 10, 2026 15:02

openshift-ci-robot added the rehearsals-ack Signifies that rehearsal jobs have been acknowledged label Feb 11, 2026

openshift-ci Bot assigned muraee Feb 11, 2026

openshift-ci Bot added the lgtm Indicates that a PR is ready to be merged. label Feb 11, 2026

openshift-merge-bot Bot merged commit 3dbe019 into openshift:main Feb 11, 2026
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: review-agent duplicate bot responses due to missing API pagination#74690

fix: review-agent duplicate bot responses due to missing API pagination#74690
openshift-merge-bot[bot] merged 1 commit intoopenshift:mainfrom
bryan-cox:fix/review-agent-duplicate-bot-comments

bryan-cox commented Feb 10, 2026

Uh oh!

openshift-ci-robot commented Feb 10, 2026

Uh oh!

bryan-cox commented Feb 10, 2026

Uh oh!

openshift-ci-robot commented Feb 10, 2026

Uh oh!

openshift-ci Bot commented Feb 10, 2026

Uh oh!

muraee commented Feb 11, 2026

Uh oh!

bryan-cox commented Feb 11, 2026

Uh oh!

openshift-ci-robot commented Feb 11, 2026

Uh oh!

openshift-ci Bot commented Feb 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

bryan-cox commented Feb 10, 2026

Summary

Problem

Empirical evidence from PR #7561

Root cause 1: Missing --paginate (cross-run duplicates)

Root cause 2: check_replied.py path not reliably found in CI (within-run duplicates)

Root cause 3: API propagation delay (within-run duplicates)

Changes

Test plan

Uh oh!

openshift-ci-robot commented Feb 10, 2026

Uh oh!

bryan-cox commented Feb 10, 2026

Uh oh!

openshift-ci-robot commented Feb 10, 2026

Uh oh!

openshift-ci Bot commented Feb 10, 2026

Uh oh!

muraee commented Feb 11, 2026

Uh oh!

bryan-cox commented Feb 11, 2026

Uh oh!

openshift-ci-robot commented Feb 11, 2026

Uh oh!

openshift-ci Bot commented Feb 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Root cause 1: Missing `--paginate` (cross-run duplicates)

Root cause 2: `check_replied.py` path not reliably found in CI (within-run duplicates)