Fix Ulysses SP backward with SDPA by zhtmike · Pull Request #13328 · huggingface/diffusers

zhtmike · 2026-03-25T03:35:26Z

What does this PR do?

Solve the issue #13319.

There are two bugs:

grad_out is already in BSHD format, which matches the shape of out. Therefore, we do not need to permute grad_out again.
autograd.Function.backward() does not store the gradient by default, so we need to enable it manually.

After the fix, running torchrun --nproc-per-node 2 toy_train.py --enable-sp can get the expected result

loss=1.351188

And we add the test coverage for backward ops with context parallel.

Tested with TestQwenImageTransformerContextParallel, TestFluxTransformerContextParallel and TestFlux2TransformerContextParallel.

Fixes # (issue)

Fix Ulysses SP backward with SDPA

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@sayakpaul

sayakpaul

Ran the tests and they pass as well. Great work!

zhtmike added 2 commits March 25, 2026 10:28

add UT for backward

488f759

fix SDPA attention backward

9ed7c25

sayakpaul approved these changes Mar 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Ulysses SP backward with SDPA#13328

Fix Ulysses SP backward with SDPA#13328
zhtmike wants to merge 2 commits intohuggingface:mainfrom
zhtmike:fix_sp_backward

zhtmike commented Mar 25, 2026

Uh oh!

sayakpaul left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

zhtmike commented Mar 25, 2026

What does this PR do?

Before submitting

Who can review?

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants