update Qwen3.5 grpo demo #124
+69
−87
Merged
Starting job
Loading