Split Dataset XML generation by seanmcculloch · Pull Request #157 · AllenNeuralDynamics/Rhapso

seanmcculloch · 2026-02-11T22:51:06Z

Description of the Changes You Made

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Instructions for Testing

Additional Info

…sform

seanmcculloch · 2026-02-11T23:29:53Z

Verified that XML output of split_dataset matches xml from bigstitcher's Virtual Split command.
https://www.diffchecker.com/Jp8Z1JrM/
(original is bigstitcher, changed is rhapso). Differences in tile size is the only diff, and is irrelevant for this fix

seanmcculloch · 2026-02-11T23:30:56Z

Rhapso/data_prep/xml_to_dataframe.py

            timepoint = il.get("timepoint")
-            file_path = il.find("path").text if il.find("path") is not None else None
-            channel = file_path.split("_ch_", 1)[1].split(".ome.zarr", 1)[0]
+            file_path = il.get("path")


Update parse_image_loader_zarr() to properly handle bigstitcher xml conventions, and channel default when not in filename

seanmcculloch · 2026-02-11T23:34:27Z

Rhapso/detection/advanced_refinement.py

                if vid == view_id:
-                    to_process_interval = (lb, ub)
+
+                    ub_inclusive = (ub[0]+1, ub[1]+1, ub[2]+1)


interval needs to use inclusive coordinates for upper bound, so that it is compatible with block_interval coordinates when calling self.contain()

seanmcculloch · 2026-02-11T23:36:09Z

Rhapso/split_dataset/compute_grid_rules.py

        return int(a + b - (a % b))

-    def find_min_step_size(self):
+    def find_min_step_size(self, lowest_resolution=(1.0, 1.0, 1.0)):


Update this to allow for blocks to be created at 1.0 pixel resolution (previously was limited to blocksizes of powers of 64)

seanmcculloch · 2026-02-11T23:36:44Z

Rhapso/split_dataset/save_xml.py

-            outer_timepoints = ET.Element('Timepoints', {'type': 'pattern'})
-            ip = ET.SubElement(outer_timepoints, 'integerpattern')
-            ip.text = "0"
+            tps = sorted({int(v['old_view'][0]) for v in self.self_definition if v['old_view'][0] is not None})


Determine timepoints integer range from pre-split views

seanmcculloch · 2026-02-11T23:36:57Z

Rhapso/split_dataset/split_images.py


 class SplitImages:
-    def __init__(self, target_image_size, target_overlap, min_step_size, data_gloabl, n5_path, point_density, min_points, max_points, 
+    def __init__(self, target_image_size, target_overlap, min_step_size, data_global, n5_path, point_density, min_points, max_points, 


fix typo in data_global

seanmcculloch · 2026-02-11T23:38:13Z

Rhapso/split_dataset/split_images.py

-        if size < 0:
-            size = l + size
-        return size
+    def last_image_size(self, L, S, O):


Rewrite of this function to properly determine the size of the last tile. Logic was incorrect previously. perfect uniform tiling has been verified

seanmcculloch · 2026-02-11T23:38:50Z

Rhapso/split_dataset/split_images.py


            if length <= self.target_image_size[i]:
-                pass
+                dim_intervals.append((0, length - 1))


if the entire dataset is smaller than blocksize, still make one block - use the size of the dataset in this dimension

seanmcculloch · 2026-02-11T23:39:59Z

Rhapso/split_dataset/split_images.py

-                    for j in range(i):
-                        other_interval = intervals[j]
-                        intersection = self.intersect(interval, other_interval)
+                    new_v_ip_l = []


Large diff starting here changes handling of IP detections when splitting:

adds support for not creating any fake interest points when splitting.

adds support for virtual splitting of tiles when no IPs have been detected.

seanmcculloch · 2026-02-11T23:40:47Z

Rhapso/split_dataset/xml_to_dataframe_split.py

-            file_path = il.find("path").text if il.find("path") is not None else None
-
-            channel = file_path.split("_ch_", 1)[1].split(".ome.zarr", 1)[0]
+            timepoint = il.get("tp") or il.get("timepoint")


Update xml_to_dataframe_split.parse_image_loader_zarr() to match changes to data_prep.xml_to_dataframe.parse_image_loader_zarr()

seanmcculloch · 2026-02-23T22:13:22Z

test fusion by running split, then calling fusion to test that it fusion works.

run rhapso pipeline all the way through

run on exaspim through to split affine.
then, run that output on bigstitcher capsule for fusion

seanmcculloch added 5 commits February 3, 2026 00:51

split r2r

914d424

detection dev

e66d4bf

fix: split dataset xml generation fix timepoints and calibration tran…

d8367a4

…sform

chore: clean diff

afa6965

fix: update split xml_to_dataframe with better zarr parser

cca9cd9

seanmcculloch commented Feb 11, 2026

View reviewed changes

seanmcculloch requested a review from seanfite-alleninstitute February 11, 2026 23:41

seanmcculloch marked this pull request as ready for review February 11, 2026 23:41

Merge remote-tracking branch 'origin/main' into split_dataset_r2r

ada3df0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split Dataset XML generation#157

Split Dataset XML generation#157
seanmcculloch wants to merge 6 commits intomainfrom
split_dataset_r2r

seanmcculloch commented Feb 11, 2026

Uh oh!

seanmcculloch commented Feb 11, 2026

Uh oh!

seanmcculloch Feb 11, 2026

Uh oh!

seanmcculloch Feb 11, 2026

Uh oh!

seanmcculloch Feb 11, 2026

Uh oh!

seanmcculloch Feb 11, 2026

Uh oh!

seanmcculloch Feb 11, 2026

Uh oh!

seanmcculloch Feb 11, 2026

Uh oh!

seanmcculloch Feb 11, 2026

Uh oh!

seanmcculloch Feb 11, 2026

Uh oh!

seanmcculloch Feb 11, 2026

Uh oh!

seanmcculloch commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

seanmcculloch commented Feb 11, 2026

Description of the Changes You Made

Type of Change

Instructions for Testing

Additional Info

Uh oh!

seanmcculloch commented Feb 11, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

seanmcculloch commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant