Remove support for prefix matching on bucket names by Sleepful · Pull Request #499 · powersync-ja/powersync-service

Sleepful · 2026-02-11T08:15:28Z

Issue: #400

Summary

The refactor cleanly removes the U+FFFF sentinel, COLLATE "C", and prefix matching from the Postgres compactor, replacing them with incremental bucket discovery and per-bucket exact-match compaction. EXPLAIN plans confirm optimal index usage with no COLLATE and no Filter lines.

Tradeoffs

Incremental discovery vs. single-pass: The new approach adds one small index-only query per 200 buckets. For extreme deployments (millions of buckets), this adds ~5000 round-trips, but each is sub-millisecond. The plan's analysis is sound — the tradeoff is heavily in favor of correctness and simplicity.
Breaking change for CLI users: Users passing bucket definition names (-b global) will now get a validation error instead of silently compacting matching buckets. This is intentional and documented. No existing tests used prefix matching, confirming this was an unused feature path.

Manual validation

CLI warning message

node service/lib/entry.js compact -c ../powersync.yaml -b global
info: Successfully registered Module Core.
error: Invalid bucket names: global. Pass full bucket names (e.g., "global[]"), not bucket definition names (e.g., "global").
error: Recipe `ps` failed on line 7 with exit code 1

Run compact tests

PG_STORAGE_TEST_URL="postgres://postgres:postgres@localhost:5432/powersync_storage_test" pnpm --filter @powersync/service-module-postgres-storage test -- --run

✓ test/src/connection-report-storage.test.ts (14 tests) 36ms

 Test Files  5 passed (5)
      Tests  74 passed (74)
   Start at  01:55:12
   Duration  29.56s (transform 747ms, setup 3.22s, import 3.49s, tests 22.46s, environment 0ms)

Run EXPLAIN

These represent the queries that are used in file

    PGPASSWORD=postgres psql -h localhost -p 5432 -U postgres \
      -d powersync_storage_test -c "SET search_path TO powersync;" \
      -c "SELECT '=== DISCOVERY: incremental bucket enumeration ===' AS query;" \
      -c "EXPLAIN SELECT DISTINCT bucket_name FROM bucket_data WHERE group_id = 1 AND bucket_name > '' ORDER BY bucket_name ASC LIMIT 200;" \
      -c "SELECT '=== SINGLE BUCKET: no cursor ===' AS query;" \
      -c "EXPLAIN SELECT op, op_id, source_table, table_name, row_id, source_key, bucket_name FROM bucket_data WHERE group_id = 1 AND bucket_name = 'global[]' ORDER BY op_id DESC LIMIT 10000;" \
      -c "SELECT '=== SINGLE BUCKET: with cursor ===' AS query;" \
      -c "EXPLAIN SELECT op, op_id, source_table, table_name, row_id, source_key, bucket_name FROM bucket_data WHERE group_id = 1 AND bucket_name = 'global[]' AND op_id < 50000 ORDER BY op_id DESC LIMIT 10000;"

`just explain` query	Source	Lines
`DISCOVERY: incremental bucket enumeration`	`PostgresCompactor.ts:90-98`	`compactAllBuckets()` discovery query: `SELECT DISTINCT bucket_name ... WHERE bucket_name > $lastBucket ORDER BY bucket_name ASC LIMIT 200`
`SINGLE BUCKET: no cursor`	`PostgresCompactor.ts:126-149`	`compactSingleBucket()` on first iteration, when `upperOpIdLimit = BIGINT_MAX` (line 123) — effectively no cursor constraint
`SINGLE BUCKET: with cursor`	`PostgresCompactor.ts:126-149`	Same query on subsequent iterations, after `upperOpIdLimit = lastBatchItem.op_id` (line 158) makes the `AND op_id < $upperOpIdLimit` clause a real constraint

                       query
---------------------------------------------------
 === DISCOVERY: incremental bucket enumeration ===
(1 row)

                                             QUERY PLAN
-----------------------------------------------------------------------------------------------------
 Limit  (cost=0.15..8.17 rows=1 width=32)
   ->  Result  (cost=0.15..8.17 rows=1 width=32)
         ->  Unique  (cost=0.15..8.17 rows=1 width=32)
               ->  Index Only Scan using unique_id on bucket_data  (cost=0.15..8.17 rows=1 width=32)
                     Index Cond: ((group_id = 1) AND (bucket_name > ''::text))
(5 rows)

              query
----------------------------------
 === SINGLE BUCKET: no cursor ===
(1 row)

                                          QUERY PLAN
----------------------------------------------------------------------------------------------
 Limit  (cost=0.15..8.17 rows=1 width=200)
   ->  Index Scan Backward using unique_id on bucket_data  (cost=0.15..8.17 rows=1 width=200)
         Index Cond: ((group_id = 1) AND (bucket_name = 'global[]'::text))
(3 rows)

               query
------------------------------------
 === SINGLE BUCKET: with cursor ===
(1 row)

                                          QUERY PLAN
-----------------------------------------------------------------------------------------------
 Limit  (cost=0.15..8.17 rows=1 width=200)
   ->  Index Scan Backward using unique_id on bucket_data  (cost=0.15..8.17 rows=1 width=200)
         Index Cond: ((group_id = 1) AND (bucket_name = 'global[]'::text) AND (op_id < 50000))
(3 rows)

Skipped validation

I did not run the MongoDB tests, seems like those files were not touched. Let me know if I should run them, or if CI takes care of it. Basically this command: pnpm --filter @powersync/service-module-mongodb-storage test -- --run

CLAassistant · 2026-02-11T08:15:34Z

All committers have signed the CLA.

changeset-bot · 2026-02-11T08:15:35Z

🦋 Changeset detected

Latest commit: 4d3ff2c

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 12 packages

Name	Type
@powersync/service-module-postgres-storage	Patch
@powersync/service-core	Patch
@powersync/service-schema	Patch
@powersync/service-module-mongodb	Patch
@powersync/service-module-mssql	Patch
@powersync/service-module-mysql	Patch
@powersync/service-module-postgres	Patch
@powersync/service-image	Patch
@powersync/service-core-tests	Patch
@powersync/service-module-core	Patch
@powersync/service-module-mongodb-storage	Patch
test-client	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

rkistner

Thanks!

Sleepful · 2026-02-20T08:19:03Z

Pushed two fixes:

Updated test to use updateSyncRulesFromYaml() interface (aligned with upstream changes)
Ran prettier on the three affected files

Build and format checks pass locally. Ready for re-review.

Sleepful · 2026-02-20T09:03:12Z

Fixed the actual test failure: getBucketDataBatch was returning 0 items because the test used bare bucket names (global[]) but storage version 2 now uses versioned names (1#global[]). Updated the test to use bucketRequest() / bucketRequestMap() helpers. All 92 postgres-storage tests pass locally.

Sleepful · 2026-02-20T09:47:20Z

⚠️ Flaky CI test ⚠️

There was a CI failure in Postgres Test (12) (https://github.com/powersync-ja/powersync-service/actions/runs/22217913667/job/64265997954?pr=499) — looks like a flaky test. It's replicating changing primary key in wal_stream.test.ts, which this PR doesn't touch. Passes on all other PG versions (11, 13–18) and on main.

I re-ran the failed test without any changes and it passed ✅

… fix Adds a test that verifies the compactor properly handles bucket_name comparisons when using U+FFFF as a sentinel upper bound. Under locale-aware collation (e.g. en_US.UTF-8), alphanumeric strings can sort after U+FFFF, causing the initial pagination query to return zero rows and skip compaction entirely. The test inserts two PUT ops for the same row, then compacts without specifying compactBuckets (triggering the bucketUpper = U+FFFF path). After compaction, the older PUT should be converted to MOVE - which only works correctly with COLLATE "C" forcing byte-order comparison.

The COLLATE regression test (Postgres Compact - COLLATE regression) verified that compaction works when bucket_name is compared against the U+FFFF sentinel under locale-aware collation. This test is redundant with the shared compacting tests in register-compacting-tests.ts: - compacting (1) through (4) all exercise the bucket==null (compact-all) path - They verify MOVE conversion with stronger assertions (validateCompactedBucket) - The extra test was a minimal reproduction of the same code path The sentinel and COLLATE "C" will be removed entirely as part of the compactor refactor, making this regression impossible to trigger. The shared tests provide full coverage of the behavior we care about: MOVE conversion works correctly when compacting all buckets.

…at a time Replace complex bucket range filtering in PostgresCompactor with a cleaner two-method approach: compactAllBuckets() discovers buckets in batches and compactSingleBucket() handles individual bucket compaction. Changes: - Split compactInternal into compactAllBuckets and compactSingleBucket - Remove bucket range string matching (bucketLower/bucketUpper) logic - Add CLI validation requiring full bucket names (e.g., "global[]") - Update documentation to clarify compactBuckets format requirements - Add test coverage for explicit bucket name compaction

…erface

Sleepful · 2026-02-20T18:21:19Z

rebased

Sleepful marked this pull request as draft February 11, 2026 08:16

Sleepful force-pushed the jv-2026-02-10-compaction branch from 9d82ddb to a9cfb37 Compare February 11, 2026 08:55

Sleepful marked this pull request as ready for review February 11, 2026 08:56

rkistner previously approved these changes Feb 11, 2026

View reviewed changes

Sleepful dismissed rkistner’s stale review via 010444e February 20, 2026 06:22

Sleepful force-pushed the jv-2026-02-10-compaction branch from 010444e to 0f4b970 Compare February 20, 2026 06:32

Sleepful requested a review from rkistner February 20, 2026 18:13

Sleepful added 7 commits February 20, 2026 12:17

chore: add changeset for Postgres compaction refactor

b291629

fix(postgres-storage): update test to use updateSyncRulesFromYaml int…

e7a0971

…erface

style: format compaction files with prettier

761f242

fix(postgres-storage): use versioned bucket names in compaction test

4d3ff2c

Sleepful force-pushed the jv-2026-02-10-compaction branch from c02c457 to 4d3ff2c Compare February 20, 2026 18:19

rkistner approved these changes Feb 23, 2026

View reviewed changes

Sleepful merged commit 65f3c89 into powersync-ja:main Feb 23, 2026
21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove support for prefix matching on bucket names#499

Remove support for prefix matching on bucket names#499
Sleepful merged 7 commits intopowersync-ja:mainfrom
Sleepful:jv-2026-02-10-compaction

Sleepful commented Feb 11, 2026 •

edited

Loading

Uh oh!

CLAassistant commented Feb 11, 2026 •

edited

Loading

Uh oh!

changeset-bot Bot commented Feb 11, 2026 •

edited

Loading

Uh oh!

rkistner left a comment

Uh oh!

Sleepful commented Feb 20, 2026

Uh oh!

Sleepful commented Feb 20, 2026

Uh oh!

Sleepful commented Feb 20, 2026

Uh oh!

Sleepful commented Feb 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Sleepful commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Tradeoffs

Manual validation

CLI warning message

Run compact tests

Run EXPLAIN

Skipped validation

Uh oh!

CLAassistant commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

changeset-bot Bot commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

rkistner left a comment

Choose a reason for hiding this comment

Uh oh!

Sleepful commented Feb 20, 2026

Uh oh!

Sleepful commented Feb 20, 2026

Uh oh!

Sleepful commented Feb 20, 2026

Uh oh!

Sleepful commented Feb 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Sleepful commented Feb 11, 2026 •

edited

Loading

CLAassistant commented Feb 11, 2026 •

edited

Loading

changeset-bot Bot commented Feb 11, 2026 •

edited

Loading