Skip to content

Modular GDN forward / backward kernels (compatible with Kimi CP) #13

@icavan

Description

@icavan

Description

Implement modular forward and backward CUDA kernels for the GDN (Gated Delta Network) algorithm, compatible with Kimi CP.

Context

GDN is a linear attention variant that improves expressiveness with gating and delta-style updates. Implementing modular forward/backward kernels compatible with Kimi CP would enable efficient distributed training via context parallelism.

Tasks

  • Implement GDN forward kernels (modular, similar to KDA structure)
  • Implement GDN backward kernels
  • Ensure compatibility with Kimi CP's context parallelism protocol
  • Add correctness tests against FLA reference implementation
  • Add benchmarks

References

Metadata

Metadata

Assignees

Labels

enhancementNew feature or requesthelp wantedExtra attention is needed

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions