Skip to content

[ovndbcluster] Allow config changes without pod restarts#549

Merged
openshift-merge-bot[bot] merged 1 commit intoopenstack-k8s-operators:mainfrom
karelyatin:OSPRH-27985-job
Apr 14, 2026
Merged

[ovndbcluster] Allow config changes without pod restarts#549
openshift-merge-bot[bot] merged 1 commit intoopenstack-k8s-operators:mainfrom
karelyatin:OSPRH-27985-job

Conversation

@karelyatin
Copy link
Copy Markdown
Contributor

@karelyatin karelyatin commented Apr 2, 2026

Currently for any config change pod get's restart which we don't need for configs like:-

  • ElectionTimer
  • InactivityProbe
  • LogLevel

Deprecated probeIntervalToActive config option as
that is not relevant only for active/standby db clusters which we are not using here.

Also using db directories(as persistent) for OVN_RUNDIR, so we can use them in job pods to do the config changes.

Resolves: #OSPRH-27985
Assisted-By: Claude

@openshift-ci openshift-ci bot requested review from abays and slawqo April 2, 2026 16:31
@openshift-ci openshift-ci bot added the approved label Apr 2, 2026
@softwarefactory-project-zuul
Copy link
Copy Markdown

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://softwarefactory-project.io/zuul/t/rdoproject.org/buildset/f021bdefc9c34b7f823ff7d1c795d42d

openstack-k8s-operators-content-provider FAILURE in 6m 08s
⚠️ ovn-operator-tempest-multinode SKIPPED Skipped due to failed job openstack-k8s-operators-content-provider

@karelyatin
Copy link
Copy Markdown
Contributor Author

recheck trunkrdo

@softwarefactory-project-zuul
Copy link
Copy Markdown

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://softwarefactory-project.io/zuul/t/rdoproject.org/buildset/75cf967375fd46b89297dcb96d40bb5b

✔️ openstack-k8s-operators-content-provider SUCCESS in 1h 27m 28s
ovn-operator-tempest-multinode FAILURE in 1h 08m 50s

@karelyatin karelyatin force-pushed the OSPRH-27985-job branch 3 times, most recently from 7af8de6 to 1d5b62d Compare April 7, 2026 14:26
karelyatin added a commit to karelyatin/openstack-must-gather that referenced this pull request Apr 8, 2026
With [1] new paths are used, so we need to use those
and fallback to older path for backward compatibility.

[1] openstack-k8s-operators/ovn-operator#549

Related-Issue: #OSPRH-27985
Signed-off-by: Yatin Karel <ykarel@redhat.com>
Comment thread internal/controller/ovndbcluster_controller.go Outdated
Comment thread internal/controller/ovndbcluster_controller.go Outdated
Comment thread internal/controller/ovndbcluster_controller.go
Comment thread internal/ovndbcluster/runtimeconfigjob.go
@averdagu
Copy link
Copy Markdown
Contributor

averdagu commented Apr 8, 2026

So far code looks good to me, I'll test this tomorrow

@softwarefactory-project-zuul
Copy link
Copy Markdown

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://softwarefactory-project.io/zuul/t/rdoproject.org/buildset/c94c02f79a5c4014815fb1deaf9bf96e

✔️ openstack-k8s-operators-content-provider SUCCESS in 42m 50s
ovn-operator-tempest-multinode FAILURE in 26m 02s

Comment thread internal/ovndbcluster/runtimeconfigjob.go
@karelyatin karelyatin force-pushed the OSPRH-27985-job branch 2 times, most recently from 958bda6 to 152447c Compare April 9, 2026 11:40
@softwarefactory-project-zuul
Copy link
Copy Markdown

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://softwarefactory-project.io/zuul/t/rdoproject.org/buildset/8c894ca3175b4071805370bdeb6d6e45

✔️ openstack-k8s-operators-content-provider SUCCESS in 1h 30m 27s
ovn-operator-tempest-multinode FAILURE in 1h 09m 58s

Currently for any config change pod get's restart which
we don't need for configs like:-
- ElectionTimer
- InactivityProbe
- LogLevel

Deprecated probeIntervalToActive config option as
that is not relevant only for active/standby db clusters
which we are not using here.

Also using db directories(as persistent) for OVN_RUNDIR,
so we can use them in job pods to do the config changes.

Resolves: #OSPRH-27985
Assisted-By: Claude
Signed-off-by: Yatin Karel <ykarel@redhat.com>
@averdagu
Copy link
Copy Markdown
Contributor

averdagu commented Apr 9, 2026

Adding comment here instead of code as I don't know the best way to handle this, but we need to sanitize the input given by the user.
I tested out by adding "logLevel: debug" but ovn-appctl only accepts "dbg".
We need to check the input either on the CR, or in the script (though doing it in the script may seem all went fine but log level was not changed)

@karelyatin
Copy link
Copy Markdown
Contributor Author

Adding comment here instead of code as I don't know the best way to handle this, but we need to sanitize the input given by the user. I tested out by adding "logLevel: debug" but ovn-appctl only accepts "dbg". We need to check the input either on the CR, or in the script (though doing it in the script may seem all went fine but log level was not changed)

yes i noticed that too but since it was not specific to this change i didn't touched api for this. So we can take that as follow up too

@averdagu
Copy link
Copy Markdown
Contributor

Adding comment here instead of code as I don't know the best way to handle this, but we need to sanitize the input given by the user. I tested out by adding "logLevel: debug" but ovn-appctl only accepts "dbg". We need to check the input either on the CR, or in the script (though doing it in the script may seem all went fine but log level was not changed)

yes i noticed that too but since it was not specific to this change i didn't touched api for this. So we can take that as follow up too

For the record, created an issue for this problem that I plan to work it on :)

@averdagu
Copy link
Copy Markdown
Contributor

Looks all good to me

@openshift-ci openshift-ci bot added the lgtm label Apr 14, 2026
@openshift-ci
Copy link
Copy Markdown
Contributor

openshift-ci bot commented Apr 14, 2026

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: karelyatin, slawqo

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@karelyatin
Copy link
Copy Markdown
Contributor Author

/test ovn-operator-build-deploy-kuttl
must gather fail

@karelyatin
Copy link
Copy Markdown
Contributor Author

/test ovn-operator-build-deploy-kuttl
gather failure last

@openshift-merge-bot openshift-merge-bot bot merged commit 81fd12b into openstack-k8s-operators:main Apr 14, 2026
6 checks passed
@karelyatin karelyatin deleted the OSPRH-27985-job branch April 14, 2026 14:44
@karelyatin
Copy link
Copy Markdown
Contributor Author

/cherry-pick 18.0-fr5

@openshift-cherrypick-robot
Copy link
Copy Markdown

@karelyatin: new pull request created: #555

Details

In response to this:

/cherry-pick 18.0-fr5

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants