-
Notifications
You must be signed in to change notification settings - Fork 4.8k
OCPEDGE-2011: test(two-node): stablize tnf recovery suite #30716
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
OCPEDGE-2011: test(two-node): stablize tnf recovery suite #30716
Conversation
|
Pipeline controller notification For optional jobs, comment This repository is configured in: automatic mode |
|
@jaypoulz: This pull request references OCPEDGE-2011 which is a valid jira issue. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/payload-job periodic-ci-openshift-release-master-nightly-4.22-e2e-metal-ovn-two-node-fencing-recovery-techpreview |
|
@eggfoobar: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/b98661b0-f7d2-11f0-965f-3eb360421fed-0 |
|
/test verify Checking if fluke |
fca0530 to
447b67f
Compare
|
Scheduling required tests: |
|
/payload-job pull-ci-openshift-release-master-nightly-4.22-e2e-metal-ovn-two-node-fencing-recovery-techpreview |
|
@eggfoobar: trigger 0 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command |
|
/payload-job periodic-ci-openshift-release-master-nightly-4.22-e2e-metal-ovn-two-node-fencing-recovery-techpreview |
|
@eggfoobar: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/6edd8510-f7e8-11f0-9c54-2a9b6caa1fe3-0 |
clobrano
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I just left one comment, otherwise lgtm
|
/hold |
b2eb821 to
cd9d035
Compare
|
@jaypoulz: This pull request references OCPEDGE-2011 which is a valid jira issue. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/payload-job periodic-ci-openshift-release-master-nightly-4.22-e2e-metal-ovn-two-node-fencing-recovery-techpreview |
|
@jaypoulz: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/590785b0-fac2-11f0-93aa-392383318691-0 |
clobrano
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great rework, I like the idea to track skipped tests!
/lgtm
cd9d035 to
c387d04
Compare
|
New changes are detected. LGTM label has been removed. |
|
/payload-job periodic-ci-openshift-release-master-nightly-4.22-e2e-metal-ovn-two-node-fencing-recovery-techpreview |
|
@jaypoulz: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/87b2a9b0-fd1c-11f0-9c7d-84e7536a74e3-0 |
|
/payload-job periodic-ci-openshift-release-master-nightly-4.22-e2e-metal-ovn-two-node-fencing-recovery-techpreview |
|
@jaypoulz: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/e451f060-fd33-11f0-8490-88a959329d05-0 |
|
/payload-job periodic-ci-openshift-release-master-nightly-4.22-e2e-metal-ovn-two-node-fencing-recovery-techpreview |
|
@jaypoulz: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/e2c8cfa0-fd44-11f0-9fd5-54fbf52872ce-0 |
70afc49 to
7824a88
Compare
|
/payload-job periodic-ci-openshift-release-master-nightly-4.22-e2e-metal-ovn-two-node-fencing-recovery-techpreview |
|
@eggfoobar: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/847b0a40-fd5c-11f0-80ad-b0c189ec5243-0 |
|
/payload-job periodic-ci-openshift-release-master-nightly-4.22-e2e-metal-ovn-two-node-fencing-recovery-techpreview |
|
@eggfoobar: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/801f2400-fd79-11f0-9a89-5659735b00a6-0 |
f0b86c7 to
1c2bc6b
Compare
|
/payload-job periodic-ci-openshift-release-master-nightly-4.22-e2e-metal-ovn-two-node-fencing-recovery-techpreview |
|
@jaypoulz: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/7694ff00-fe45-11f0-91b0-a115b2c7995c-0 |
1c2bc6b to
0c18169
Compare
|
Scheduling required tests: |
0c18169 to
9908909
Compare
|
/payload-job periodic-ci-openshift-release-master-nightly-4.22-e2e-metal-ovn-two-node-fencing-recovery-techpreview |
|
@jaypoulz: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/a575c0e0-fe4d-11f0-8967-f7abf998ba7b-0 |
DualReplica topology runs etcd externally via Pacemaker/Podman rather than as static pods. Skip pod log streaming when this topology is detected to avoid spurious errors. Co-authored-by: Claude Opus 4.5 (Anthropic) <noreply@anthropic.com> Co-authored-by: Cursor <cursoragent@cursor.com>
9908909 to
1c79060
Compare
Adds infrastructure to detect and report tests that skip due to unmet cluster preconditions: - Add precondition skip detection in cmd_runsuite.go that converts skips with "unmet cluster preconditions" marker to synthetic failures - Handle framework initialization errors gracefully instead of panicking - Update IsMicroShiftCluster to skip with precondition marker on timeout - Run two-node suite serially with Disruptive marker Co-authored-by: Claude Opus 4.5 (Anthropic) <noreply@anthropic.com> Co-authored-by: Cursor <cursoragent@cursor.com>
Improves two-node cluster test reliability and diagnostics: - Use stonith confirm for quorum recovery when VM is destroyed - Wait for CEO update-setup jobs instead of manual pacemaker cycling - Add diagnostic gathering on test failure (VM states, pcs status, etcd members) - Add pacemaker cleanup before health checks to clear stale failures - Add context support to debug container pacemaker functions - Improve progress logging during recovery validation - Replace klog with e2e.Logf for consistent ginkgo log capture - Fix topology detection to bypass stale framework cache - Reduce verbose logging in libvirt/pacemaker utilities Co-authored-by: Claude Opus 4.5 (Anthropic) <noreply@anthropic.com> Co-authored-by: Cursor <cursoragent@cursor.com>
OPM index.json files are JSON arrays, not objects. The jsonformat tool only handles objects, so exclude these files from verification. Co-authored-by: Claude Opus 4.5 (Anthropic) <noreply@anthropic.com> Co-authored-by: Cursor <cursoragent@cursor.com>
1c79060 to
1f759fb
Compare
|
/payload-job periodic-ci-openshift-release-master-nightly-4.22-e2e-metal-ovn-two-node-fencing-recovery-techpreview |
|
@jaypoulz: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/25a50dc0-fe53-11f0-8776-19560c1b7387-0 |
|
Scheduling required tests: |
|
@jaypoulz: all tests passed! Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
Individual recovery tests now skip when cluster preconditions aren't met, but an AfterSuite hook ensures the suite fails with diagnostic information about which tests were skipped and why. This makes precondition failures visible to CI analysis services while maintaining test stability.