fix: correct NVIDIA CUDA queue guard in run_opencl_fft by xywei · Pull Request #271 · inducer/sumpy

xywei · 2026-03-17T16:09:36Z

Closes #270.

Summary

fix an inverted queue check in run_opencl_fft for the NVIDIA CUDA path
raise only when wait_for contains events from a different command queue
keep same-queue events valid so the marker workaround behaves as intended

Why

The previous condition used if not evt.command_queue != queue, which is logically equivalent to evt.command_queue == queue. That raised on the safe same-queue case and failed to reject different queues.

Validation

reproduced downstream failure in volumential on NVIDIA CUDA before the fix (RuntimeError: Different queues not supported with NVIDIA CUDA)
after this one-line change, test/test_volume_fmm.py::test_volume_fmm_laplace passes for NVIDIA CUDA on ipa

inducer · 2026-03-17T18:25:09Z

Thx!

fix: correct NVIDIA queue guard in run_opencl_fft

299aaff

xywei mentioned this pull request Mar 17, 2026

Migrate near-field table cache from HDF5 to SQLite xywei/volumential#18

Merged

inducer merged commit 118ae83 into inducer:main Mar 17, 2026
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: correct NVIDIA CUDA queue guard in run_opencl_fft#271

fix: correct NVIDIA CUDA queue guard in run_opencl_fft#271
inducer merged 1 commit intoinducer:mainfrom
xywei:fix-nvidia-cuda-queue-check

xywei commented Mar 17, 2026

Uh oh!

Uh oh!

inducer commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

xywei commented Mar 17, 2026

Summary

Why

Validation

Uh oh!

Uh oh!

inducer commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants