feat: change default file format version to 2.1 by westonpace · Pull Request #6115 · lance-format/lance

westonpace · 2026-03-06T14:04:16Z

No description provided.

github-actions · 2026-03-06T14:06:24Z

PR Review: feat: change default file format version to 2.1

The change is small and well-scoped. A few observations:

Enum ordering and `Ord` derivation (P1 — fragility risk)

LanceFileVersion derives Ord/PartialOrd, so variant declaration order determines comparison results. This PR reorders variants to place V2_1 before Stable (previously Stable was before V2_1). This changes the behavior of ordering-dependent methods like support_add_sub_column (self > &Self::V2_1) and support_remove_sub_column (self <= &Self::V2_1) when called on Stable.

In practice this is currently safe because Stable/Next are always resolved to concrete versions before being stored in manifests, so they should never reach those comparison methods. However, this design is fragile — nothing in the type system prevents calling support_add_sub_column() on an unresolved Stable. Consider either:

Having support_add_sub_column/support_remove_sub_column call .resolve() first, or
Adding a debug assertion that the variant isn't Stable/Next

This would prevent a subtle bug if a future caller forgets to resolve.

`resolve()` → `Self::default()` — good

Tying Stable resolution to Self::default() instead of hardcoding Self::V2_0 is a nice improvement that keeps them in sync automatically.

Test updates look correct

Python test_metadata: relaxing exact buffer assertions to alignment-based checks makes sense since v2.1 uses different encodings with different buffer layouts.
Rust test_insert_nested_subschemas: v2.1 no longer stores the parent struct field ID as a separate column, so removing field 0 from the expected fields/column_indices is correct.

Missing

No migration guide or changelog note for this default change. Bumping the default file format version is user-visible — downstream consumers on older Lance versions may not be able to read files produced with the new default. Worth documenting.

codecov · 2026-03-06T15:00:54Z

Codecov Report

❌ Patch coverage is 98.38710% with 1 line in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
rust/lance-core/src/utils/futures.rs	98.14%	0 Missing and 1 partial ⚠️

📢 Thoughts on this report? Let us know!

westonpace

Mostly minor test fixes but there was one legitimate bug that needed fixed.

westonpace · 2026-03-06T23:41:04Z

python/python/tests/test_file.py

+    assert len(page.buffers) > 0
+    for buffer in page.buffers:
+        assert buffer.position % 64 == 0
+        assert buffer.size > 0


Miniblock means we now have two buffers, even for primitive data.

westonpace · 2026-03-06T23:42:01Z

python/python/tests/test_optimize.py

-    # until we've accumulated 8MiB for a page.
    assert metrics.fragments_removed == 2
-    assert metrics.fragments_added == 4
+    assert metrics.fragments_added > 2


Spent some time tracking this down. It is 8 now instead of 4. It seems when we read in 2.1 the arrays get allocated a buffer with 2MB of capacity even though there is only 1MB of data. Might be worth some further investigation at some point but not critical.

westonpace · 2026-03-06T23:42:17Z

rust/lance-core/src/utils/futures.rs

+/// this fires when the wrapper is dropped — even if the stream was not fully
+/// consumed.
+#[pin_project(PinnedDrop)]
+pub struct OnDropStream<S: Stream, F: FnOnce()> {


Extra utility to add on_drop method

westonpace · 2026-03-06T23:44:26Z

rust/lance-encoding/src/decoder.rs

-    stream.chain(check_scheduler).boxed()
+    stream
+        .chain(check_scheduler)
+        .on_drop(move || {


This was a bit tricky. There was no initialization needed in 2.0 and so we never noticed this. Bascially...the I/O queue fills up and blocks, the decode thread can't get around to decoding and draining the backpressure queue because the stream has been dropped. So the scheduler hangs. This is expected. When we drop everything we should cancel all the I/O and cleanup.

However, we don't drop everything because we have a reference to the I/O scheduler here. So the fix here aborts the scheduler thread if the stream is dropped.

…orted scan. Fix tests

github-actions bot added enhancement New feature or request python labels Mar 6, 2026

github-actions bot added the java label Mar 6, 2026

westonpace commented Mar 6, 2026

View reviewed changes

westonpace added 5 commits March 16, 2026 06:59

Change default file format version to 2.1

fd55ddf

Fix failing java tests

7d5296b

WIP

dbd5171

Fix bug where scheduler task was keeping i/o scheduler alive after ab…

3e5429c

…orted scan. Fix tests

WIP

332c367

westonpace force-pushed the feat/default-file-version-2-1 branch from 9532ca2 to 332c367 Compare March 16, 2026 13:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: change default file format version to 2.1#6115

feat: change default file format version to 2.1#6115
westonpace wants to merge 5 commits intolance-format:mainfrom
westonpace:feat/default-file-version-2-1

westonpace commented Mar 6, 2026

Uh oh!

github-actions bot commented Mar 6, 2026

Uh oh!

codecov bot commented Mar 6, 2026 •

edited

Loading

Uh oh!

westonpace left a comment

Uh oh!

westonpace Mar 6, 2026

Uh oh!

westonpace Mar 6, 2026

Uh oh!

westonpace Mar 6, 2026

Uh oh!

westonpace Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

westonpace commented Mar 6, 2026

Uh oh!

github-actions bot commented Mar 6, 2026

PR Review: feat: change default file format version to 2.1

Enum ordering and Ord derivation (P1 — fragility risk)

resolve() → Self::default() — good

Test updates look correct

Missing

Uh oh!

codecov bot commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

westonpace left a comment

Choose a reason for hiding this comment

Uh oh!

westonpace Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

westonpace Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

westonpace Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

westonpace Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Enum ordering and `Ord` derivation (P1 — fragility risk)

`resolve()` → `Self::default()` — good

codecov bot commented Mar 6, 2026 •

edited

Loading