Which component requires the feature?
CuTe DSL
Feature Request
We're planning a project that involves calling NVSHMEM put/get inside the kernel. Is there plan for cute-dsl to support this (maybe through FFI?)
We could write it in CUDA but Cute-DSL is quite a bit nicer to work with :D
Which component requires the feature?
CuTe DSL
Feature Request
We're planning a project that involves calling NVSHMEM put/get inside the kernel. Is there plan for cute-dsl to support this (maybe through FFI?)
We could write it in CUDA but Cute-DSL is quite a bit nicer to work with :D