Skip to content

Basic tensor support for AMDGPU#341

Open
kshyatt wants to merge 14 commits intomainfrom
ksh/amd
Open

Basic tensor support for AMDGPU#341
kshyatt wants to merge 14 commits intomainfrom
ksh/amd

Conversation

@kshyatt
Copy link
Member

@kshyatt kshyatt commented Jan 5, 2026

Mostly copied from the CUDA support

@kshyatt kshyatt requested review from Jutho and lkdvos January 5, 2026 13:59
@github-actions
Copy link
Contributor

github-actions bot commented Jan 5, 2026

Your PR no longer requires formatting changes. Thank you for your contribution!

lkdvos
lkdvos previously approved these changes Jan 5, 2026
Copy link
Member

@lkdvos lkdvos left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In principle looks good to me!

Do we want to wait with the AMD support until the dust settles on CUDA (mostly to avoid having to duplicate things we might still change), or should we just go ahead with this?

@kshyatt
Copy link
Member Author

kshyatt commented Jan 6, 2026

This doesn't include factorization stuff which is the only inflight CUDA thing, I think? Most of the diff is the tests, tbh

@lkdvos
Copy link
Member

lkdvos commented Jan 6, 2026

The tests is actually what I was thinking of, but maybe it's really not that bad

@kshyatt kshyatt force-pushed the ksh/amd branch 2 times, most recently from 33e01a6 to fa009dc Compare January 21, 2026 12:47
lkdvos
lkdvos previously approved these changes Jan 21, 2026
Copy link
Member

@lkdvos lkdvos left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall looks good to me, I would be happy to merge and gradually improve

@test ht2 == TensorKit.to_cpu(dt2)
end

dt3 = AMDGPU.@allowscalar repartition(t, k)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are we tracking these @allowscalar calls somewhere? Technically this test is now not really testing whether or not it works :p

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

At least in the tests we can just search in the file, I can make a tracker comment at the top?

@kshyatt
Copy link
Member Author

kshyatt commented Jan 21, 2026

Let me figure out where the segfaults are happening then I'm also ok to merge

kshyatt and others added 2 commits March 2, 2026 14:38
Set the timeout for AMDGPU to be more generous for now
@codecov
Copy link

codecov bot commented Mar 3, 2026

Codecov Report

❌ Patch coverage is 49.31507% with 37 lines in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
ext/TensorKitAMDGPUExt/roctensormap.jl 48.61% 37 Missing ⚠️
Files with missing lines Coverage Δ
ext/TensorKitAMDGPUExt/TensorKitAMDGPUExt.jl 100.00% <100.00%> (ø)
ext/TensorKitAMDGPUExt/roctensormap.jl 48.61% <48.61%> (ø)
🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants