perf: WAND and BM25 read optimizations#6214
perf: WAND and BM25 read optimizations#6214esteban wants to merge 2 commits intolance-format:mainfrom
Conversation
Optimize the FTS query hot path for lower CPU usage: - Inline BM25 scoring into WAND inner loop, pre-sort postings by df - Phase-split scheduler: async I/O loading then rayon CPU compute - Dedicated rayon thread pool sized to Lance CPU budget - Partition-level stats cache and per-token doc_freq cache
|
ACTION NEEDED The PR title and description are used as the merge commit message. Please update your PR title and description to match the specification. For details on the error please inspect the "PR Title Check" action. |
PR Review: perf: WAND and BM25 read optimizationsThe overall approach is sound — phase-splitting async I/O from CPU compute, caching partition stats, pre-extracting block metadata, and avoiding redundant decompression via P1: Unbounded
|
Codecov Report❌ Patch coverage is 📢 Thoughts on this report? Let us know! |
Optimize the FTS query hot path for lower CPU usage:
Benchmarking is WIP.