[webgpu] Set `is_channels_last` to true by default in `ComputeMatMul` by Jiawei-Shao · Pull Request #27674 · microsoft/onnxruntime

Jiawei-Shao · 2026-03-16T08:46:39Z

This patch sets is_channels_last to true by default in the parameter
of ComputeMatMul and ignores it in UseSplitK when there is no
bias.

Jiawei-Shao · 2026-03-24T03:27:40Z

@qjia7 PTAL, thanks!

Copilot

Pull request overview

This PR updates the WebGPU MatMul/Conv Split-K plumbing to treat is_channels_last as an optional signal (only meaningful when bias is present), reducing ambiguity for bias-less call sites and enabling the Split-K MatMul path.

Changes:

Change is_channels_last parameters to std::optional<bool> across WebGPU MatMul helpers and Split-K configuration.
Update shader-generation helpers (MatMulWriteFnSourceForMatMul) to accept std::optional<bool> and std::string_view.
Adjust WebGPU Conv and WebGPU BERT Attention call sites to only pass is_channels_last when bias is used.

Reviewed changes

Copilot reviewed 11 out of 13 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
onnxruntime/core/providers/webgpu/webgpu_utils.h	Updates Split-K config API to accept optional `is_channels_last`.
onnxruntime/core/providers/webgpu/webgpu_utils.cc	Updates Split-K gating logic to only consult `is_channels_last` when provided.
onnxruntime/core/providers/webgpu/vendor/intel/math/matmul.h	Simplifies Intel MatMul subgroup program interface (bias removed).
onnxruntime/core/providers/webgpu/vendor/intel/math/matmul.cc	Updates Intel subgroup shader generation for new MatMul write helper signature.
onnxruntime/core/providers/webgpu/nn/conv.cc	Passes `is_channels_last` only when Conv MatMul path includes bias.
onnxruntime/core/providers/webgpu/math/matmul_packed.h	Updates MatMul program API to carry optional `is_channels_last`.
onnxruntime/core/providers/webgpu/math/matmul_packed.cc	Treats bias presence as `is_channels_last_.has_value()` and threads optional into shader generation.
onnxruntime/core/providers/webgpu/math/matmul.h	Makes `ComputeMatMul` accept optional `is_channels_last` with a default of `{}`.
onnxruntime/core/providers/webgpu/math/matmul.cc	Enforces consistency between bias presence and `is_channels_last` engagement; wires optional into Split-K selection and MatMul program creation.
onnxruntime/core/providers/webgpu/math/gemm_utils.h	Updates MatMul write helper signature to optional `is_channels_last` and `string_view`.
onnxruntime/core/providers/webgpu/math/gemm_utils.cc	Implements optional-aware bias handling in MatMul write helper.
onnxruntime/core/providers/webgpu/math/gemm_packed.cc	Updates Split-K selection call site to pass `std::nullopt` for `is_channels_last`.
onnxruntime/contrib_ops/webgpu/bert/attention.cc	Builds MatMul input list conditionally and passes optional `is_channels_last` only with bias.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

This patch sets `is_channels_last` to true by default in the parameter of `ComputeMatMul` and ignores it in `UseSplitK` when there is no `bias`.

Copilot

Pull request overview

Copilot reviewed 6 out of 6 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Jiawei-Shao · 2026-04-01T01:38:36Z

Hi @guschmue, could you take a look at this PR?

guschmue · 2026-04-02T01:18:23Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2026-04-02T01:18:41Z

Azure Pipelines successfully started running 4 pipeline(s).

Jiawei-Shao · 2026-04-02T01:54:30Z

The errors are not related to this PR:

The self-hosted runner lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.

Jiawei-Shao · 2026-04-03T01:33:32Z

The failures on DirectML CI are not related to this PR.

guschmue · 2026-04-08T15:21:59Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2026-04-08T15:22:22Z

Azure Pipelines successfully started running 4 pipeline(s).

[webgpu] Pass is_channels_last with std::optional

66de1b1

guschmue added the ep:WebGPU ort-web webgpu provider label Mar 16, 2026

qjia7 requested a review from Copilot March 24, 2026 09:57

Copilot started reviewing on behalf of qjia7 March 24, 2026 09:58 View session

Copilot AI reviewed Mar 24, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/webgpu/math/matmul_packed.h Outdated

Comment thread onnxruntime/core/providers/webgpu/math/gemm_utils.h Outdated

Comment thread onnxruntime/core/providers/webgpu/math/matmul.cc

qjia7 reviewed Mar 25, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/webgpu/math/matmul.h Outdated

Jiawei-Shao added 2 commits March 27, 2026 14:23

Merge branch 'main' into use-optional-matmul

4077b8f

[webgpu] Set is_channels_last to true by default in ComputeMatMul

0b37c1d

This patch sets `is_channels_last` to true by default in the parameter of `ComputeMatMul` and ignores it in `UseSplitK` when there is no `bias`.

Jiawei-Shao changed the title ~~[webgpu] Always pass is_channels_last with std::optional~~ [webgpu] Set is_channels_last to true by default in ComputeMatMul Mar 27, 2026

Jiawei-Shao requested review from Copilot and qjia7 March 27, 2026 08:29

Copilot started reviewing on behalf of Jiawei-Shao March 27, 2026 08:31 View session

Copilot AI reviewed Mar 27, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/webgpu/webgpu_utils.cc Outdated

Comment thread onnxruntime/core/providers/webgpu/math/matmul.cc Outdated

qjia7 reviewed Mar 27, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/webgpu/webgpu_utils.cc Outdated

Jiawei-Shao added 2 commits March 30, 2026 15:11

Address reviewer's comments

dc6170c

Remove is_gemm in UseSplitK

b284914

Jiawei-Shao requested a review from qjia7 March 30, 2026 07:18

qjia7 reviewed Mar 30, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/webgpu/webgpu_utils.h Outdated

Address reviewer's comments

cb0de61

Jiawei-Shao requested a review from qjia7 March 31, 2026 03:17

qjia7 approved these changes Mar 31, 2026

View reviewed changes

guschmue approved these changes Apr 2, 2026

View reviewed changes

guschmue enabled auto-merge (squash) April 2, 2026 01:18

Jiawei-Shao mentioned this pull request Apr 7, 2026

[WebGPU EP] Remove unused has_bias in MatMul::ComputeInternal #27211

Closed

Merge branch 'main' into use-optional-matmul

50cd826

guschmue closed this Apr 8, 2026

auto-merge was automatically disabled April 8, 2026 19:41
Pull request was closed

guschmue reopened this Apr 8, 2026

guschmue merged commit 9e3614b into microsoft:main Apr 9, 2026
173 of 264 checks passed

Conversation

Jiawei-Shao commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Jiawei-Shao commented Mar 24, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jiawei-Shao commented Apr 1, 2026

Uh oh!

guschmue commented Apr 2, 2026

Uh oh!

azure-pipelines bot commented Apr 2, 2026

Uh oh!

Jiawei-Shao commented Apr 2, 2026

Uh oh!

Jiawei-Shao commented Apr 3, 2026

Uh oh!

guschmue commented Apr 8, 2026

Uh oh!

azure-pipelines bot commented Apr 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Jiawei-Shao commented Mar 16, 2026 •

edited

Loading