fix(cuda): add missing runtime_utils.h include in CausalSoftmax
#49
+1
−0
runtime_utils.h include in CausalSoftmax
#49