Add optional offset arg to quantized_conv1d_nlc and precompute it AOT (#19344) by khazaei · Pull Request #19344 · pytorch/executorch

khazaei · 2026-05-06T20:38:09Z

Summary:

Extends cadence::quantized_conv1d_nlc (per_tensor / per_tensor_out) with
an optional offset tensor that carries the precomputed zero-point
correction term -(sum(W) * X_z) per output channel.

Updates the op schema in functions.yaml / functions_hifi.yaml /
ops_registrations.py to add Tensor? offset=None.
Threads the new offset argument through the generic and HiFi conv1d_nlc
kernels (currently unused by these kernels).
Updates the depthwise conv1d_nlc callers to pass an empty optional.
Extends PrecomputeForQuantizedConvPass to also precompute the offset
for quantized_conv1d_nlc.per_tensor (sum over weight dims [1, 2]) and
adds a unit test for the new path.

Reviewed By: abeakkas

Differential Revision: D103893688

pytorch-bot · 2026-05-06T20:38:13Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19344

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 18 Unrelated Failures

As of commit 8c1a576 with merge base af90130 ():

NEW FAILURE - The following job has failed:

pull / android / run-emulator (gh)
The process '/usr/local/lib/android/sdk/platform-tools/adb' failed with exit code 224

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / test-lora-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-lora-multimethod-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-models-linux (ic4, portable, linux.4xlarge.memory) / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-models-linux (ic4, xnnpack-quantization-delegation, linux.4xlarge.memory) / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-models-linux (mobilebert, portable, linux.2xlarge) / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-models-linux (mobilebert, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-models-linux (phi_4_mini, portable, linux.4xlarge.memory) / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-moshi-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-phi-3-mini-runner-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-sqnr-static-llm-qnn-linux (smollm2_135m) / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-voxtral-realtime-xnnpack-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-vulkan-models-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest / linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest / macos / macos-job (gh) (trunk failure)
[ FAILED ] OpGridSampler2dTest.BatchSizeMismatchDies
pull / unittest / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / macos / macos-job (gh) (trunk failure)
[ FAILED ] OpGridSampler2dTest.BatchSizeMismatchDies
pull / unittest-editable / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2026-05-06T20:38:17Z

@khazaei has exported this pull request. If you are a Meta employee, you can view the originating Diff in D103893688.

github-actions · 2026-05-06T20:38:49Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

…pytorch#19344) Summary: Extends `cadence::quantized_conv1d_nlc` (per_tensor / per_tensor_out) with an optional `offset` tensor that carries the precomputed zero-point correction term `-(sum(W) * X_z)` per output channel. - Updates the op schema in functions.yaml / functions_hifi.yaml / ops_registrations.py to add `Tensor? offset=None`. - Threads the new `offset` argument through the generic and HiFi conv1d_nlc kernels (currently unused by these kernels). - Updates the depthwise conv1d_nlc callers to pass an empty optional. - Extends `PrecomputeForQuantizedConvPass` to also precompute the offset for `quantized_conv1d_nlc.per_tensor` (sum over weight dims [1, 2]) and adds a unit test for the new path. Reviewed By: abeakkas Differential Revision: D103893688

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 6, 2026

meta-codesync Bot added fb-exported meta-exported labels May 6, 2026

abeakkas approved these changes May 6, 2026

View reviewed changes

meta-codesync Bot changed the title ~~Add optional offset arg to quantized_conv1d_nlc and precompute it AOT~~ Add optional offset arg to quantized_conv1d_nlc and precompute it AOT (#19344) May 6, 2026

khazaei force-pushed the export-D103893688 branch from 959c7d7 to 1538a0e Compare May 6, 2026 21:54

khazaei force-pushed the export-D103893688 branch from 1538a0e to 64c00b7 Compare May 6, 2026 23:19

khazaei force-pushed the export-D103893688 branch from 64c00b7 to 8c1a576 Compare May 7, 2026 00:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add optional offset arg to quantized_conv1d_nlc and precompute it AOT (#19344)#19344

Add optional offset arg to quantized_conv1d_nlc and precompute it AOT (#19344)#19344
khazaei wants to merge 1 commit intopytorch:mainfrom
khazaei:export-D103893688

khazaei commented May 6, 2026 •

edited by meta-codesync Bot

Loading

Uh oh!

pytorch-bot Bot commented May 6, 2026 •

edited

Loading

Uh oh!

meta-codesync Bot commented May 6, 2026

Uh oh!

github-actions Bot commented May 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

khazaei commented May 6, 2026 • edited by meta-codesync Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19344

❌ 1 New Failure, 18 Unrelated Failures

Uh oh!

meta-codesync Bot commented May 6, 2026

Uh oh!

github-actions Bot commented May 6, 2026

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

khazaei commented May 6, 2026 •

edited by meta-codesync Bot

Loading

pytorch-bot Bot commented May 6, 2026 •

edited

Loading

This PR needs a `release notes:` label