feat: Add merged vLLM rollout weights by vivekkalyan · Pull Request #631 · OpenPipe/ART

vivekkalyan · 2026-03-25T02:59:42Z

Enable ART to serve merged LoRA weights through dedicated vLLM so Qwen3.5-MoE training works on the current vLLM build.

Changes

Add rollout_weights_mode: "lora" | "merged" with dedicated-only validation and a merged-mode requirement for Qwen/Qwen3.5-35B-A3B and Qwen/Qwen3.5-397B-A17B
Push merged weights into dedicated vLLM with native weight transfer while keeping LoRA checkpoints for training and persistence
Update dedicated server wiring, validation, and Qwen3.5 smoke scripts/tests for the merged-inference path

vivekkalyan added 5 commits March 24, 2026 19:49

feat: Add dedicated rollout weights mode config

22654f4

feat: Add merged weight updates for dedicated vLLM

831b4d5

chore: Add merged rollout mode to Qwen3.5 smoke scripts

b723470

fix: Type dedicated vLLM app patch

e48304b

fix: Pin cuDNN frontend for backend installs

a0b7594