Multi-GPU Reduction & MatMul #1

Open

Labels

enhancementgood first issuehelp wanted

opened

on Aug 19, 2024

Current kernels are designed for a single-GPU execution. Let's scale them to multi-GPU systems. Ideally, using TMA and cooperative groups.

Metadata

Assignees

No one assigned

Labels

enhancementgood first issuehelp wanted

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests