Skip to content

Main snap repl#936

Open
Hamdy-khader wants to merge 16 commits intomainfrom
main-snap-repl
Open

Main snap repl#936
Hamdy-khader wants to merge 16 commits intomainfrom
main-snap-repl

Conversation

@Hamdy-khader
Copy link
Copy Markdown
Collaborator

No description provided.

Hamdy-khader and others added 2 commits March 27, 2026 14:13
* wip

* wip 2

* implement snapshot replication

* implement snapshot replication 2

* Fix env_var

* Fix service

* Fix service

* Fix service

* Fix service

* Fix service 2

* Fix service 3

* Fix service 4

* Fix service 6

* Fix service 5

* Fix service 7

* Fix service 8

* Fix service 8

* wip

* wip 2

* wip 3

* wip 3

* wip 4

* wip 4

* wip 5

* wip 7

* Fix lvol poller cpu mask

* Fix target snap name

* fix poller mask

* fix poller mask

* fix poller mask

* fix chain

* fix chain

* fix chain

* fix chain

* fix chain

* fix chain

* Set cluster_id optional on SNodeAPI docker version

* fix type checker

* fix type checker

* Fix snapshot replications tickets

sfam-2497: add snapshot check
sfam-2495: snapshot list --cluster-id
sfam-2498: clone fail

* Fix sfam-2496

* Follow up 1

* fix lvol replication_start

* fix rep service

* fix snapshot clone return value

* replicate snapshot back to src _1 (#790)

* replicate snapshot back to src _1

* fix linter

* Fix sn list apiv2 response _2

* Fix sn list apiv2 response _3

* Add stats to spdk_http_proxy_server.py

Prints max, avg and last 3 sec avg for read lines from http socket and rpc response receive from spdk

* Add stats to spdk_http_proxy_server.py _2

* Fix 2

* Fix 2

* Fix 3

* Fix sfam-2524

Do not cancel snapshot replication task  on node shutdown

* Fix sfam-2523

Show task status to be canceled when replication task status is done and cancel flag is true

* Fix sfam-2527

Fix snapshot chaining

* Increase snapshot replication task retry on node not online

* fix sfam-2516 _1

* fix sfam-2516 _2

* fix sfam-2516 _3

* fix linter

* fix sfam-2516 _4

* wip

* wip

* wip 2

* wip 2

* Exclude src snap node id when starting replication on cloned lvol

* fix snapshot replication source and target in case of replicate_to_source=True

* Main sfam 2359 api (#844)

* added api for snapshot replication

* removed helm chart dep

* fixed Remove assignment to unused variable

* added replication_start and stop to api v2 (#845)

* Enhance snapshot replication logic to support snapshot instances and streamline replication task handling

* Add replication-trigger command to start replication for logical volumes

* fix 1

* fix 1

* Fix 1

* fix typo

* fix rep status return output

* fix: handle missing replicate_as_snap_instance parameter

* fix: use unique UUID for snapshot replication identifier

* fix: improve replication duration calculation logic

* feat: add replicate_lvol_on_target_cluster function and API endpoint

* fix: change replicate_lvol endpoint from GET to POST

* fix: set lvs_name for bdev_lvol in replication process

* wip

* adds lvol clone stack

* fix: update sorting key for snapshots from creation_dt to created_at

* feat: add configuration for MCP and implement device status reset functionality

* set snapshot name when creating lvol no target cluster

* return lvol on target if exists

fix new lvol health check

* fix lvol list

* updated _ReplicationParams field (#847)

* updated _ReplicationParams field

* pool list return uuid intead of id

* lvol list return uuid intead of id

* lvol list return do_replicate

* added service snapshot-replication

* don't fails upon cr patch failure

* added imagepullpolicy

* removed csi configmap and secret from spdk-pod

* update crs name

* updated csi hostpath configuration

* updated csi hostpath configuration

* updated rpc_client logger message

* updated env_var file

* fixed snap param name created_at

* updated snapshotreplications crd

* reverted api v2 field to id from uuid

* updated env_var

* return new lvol connection string on lvol connect if cluster is suspended and lvol is replicated

* feat: add endpoint to list replication tasks for a volume

* updated endpoint and func list_replication_tasks

* update endpoint list_replication_tasks to use instance_api

* updated snapshot replication crd

* feat: add suspend and resume commands for lvol subsystems

* feat: add configuration settings and utility scripts for volume management

* feat: add configuration settings, utility scripts, and endpoints for volume management

* wip

* Adds replicate_lvol_on_source_cluster apiv2

* fix 1

* Adds 'from_source' attr to lvol model

* fix: update suspend and resume functions to return boolean values

* fix: toggle 'from_source' attribute in lvol model during replication

* return from_source from api

* fix: update lvol UUID handling during replication process

* fix: lvol delete on target

* feat: add configuration and utility scripts for managing storage nodes and volumes

* fix: update lvol attributes for cloning and set from_source flag

* refactor replicate_lvol_on_source_cluster

* fix issue

* Revert "fix issue"

This reverts commit 362935a.

* Revert "refactor replicate_lvol_on_source_cluster"

This reverts commit eb05502.

* fix issue

* adds prints

* add function to delete last snapshot if needed after replication

* fix

* fix 2

* fix: handle KeyError in lvol status change and enhance replication function with cluster validation

* avoid Runtime error if lvol not found

* feat: initialize new lvol with unique identifiers and updated naming conventions

* fix: remove unnecessary sleep calls in lvol creation process

* fix: filter out deleted lvols in get_lvols and update lvol deletion process

* fix: update lvol retrieval method to use get_lvols instead of get_all_lvols

* fix: return None instead of False when lvol is not found

* fix: update replicate_lvol_on_source_cluster to include cluster ID

* feat: add configuration for KiroAgent and implement lvol replication with pool UUID

* fix: enhance lvol creation process with target cluster NQN and update source cluster handling

* updated SimplyBlockSnapshotReplication crd field

* fix: handle missing start_time in replication duration calculation

* fix: handle forced deletion of snapshots when storage node is not found

* fix: update KMS init container image to use the correct repository

* fix lvol delete response if lvol is in_deletion

* fix 1

* Add the mount of /mnt/ramdisk to docker deployment

* fix the replicate_lvol_on_source api call params

* fix the replicate_lvol_on_source api call params 2

* fixed init job failed to mkdir /etc/systemd/

* remove init copy script container

* updated storagenode crd

* added storage cr param spdkImage

* fix: update replicate_lvol_on_source_cluster to accept cluster_id and pool_id

* fix: update lvol deletion process to set status and write to database

* Revert "fix: update replicate_lvol_on_source_cluster to accept cluster_id and pool_id"

This reverts commit b6820c5.

* point image to dockerub

* point image to dockerub

* added namespace to api resource

* feat: add clone_lvol function to create snapshots and clones of logical volumes

* feat: implement clone endpoint for logical volumes with retry logic

---------

Co-authored-by: Geoffrey Israel <israelgeoffrey13@gmail.com>
Co-authored-by: wmousa <wmousa27@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants