Arm backend: Remove old transform_for_cortex_m_backend and --enable_qdq_fusion_pass by psiddh · Pull Request #17740 · pytorch/executorch

psiddh · 2026-02-26T18:40:02Z

Summary:
Remove the transform_for_cortex_m_backend() function and deprecate --enable_qdq_fusion_pass CLI flag from aot_arm_compiler.py. Instead, ReplaceQuantNodesPass is now applied directly inside to_edge_TOSA_delegate() and to_edge_no_delegate(), making each compilation path self-contained rather than relying on a post-hoc fixup applied to all targets.
This is a prerequisite for PR #17075, which introduces Cortex-M as a first-class compilation target with its own dedicated pipeline.

cc @digantdesai @SS-JIA @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell

pytorch-bot · 2026-02-26T18:40:05Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17740

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures, 1 Cancelled Job, 2 Unrelated Failures

As of commit 7d11bdb with merge base e95555a ():

NEW FAILURES - The following jobs have failed:

pull / android / run-emulator (gh)
The process '/usr/local/lib/android/sdk/platform-tools/adb' failed with exit code 224
pull / test-multimodal-linux (gemma3-4b) / linux-job (gh)
RuntimeError: Command docker exec -t 7c0025e95c3cd31b1911571b344cea83beeac79da663a20e52bc2628f1efbe11 /exec failed with exit code 139
pull / unittest / macos / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1
Test Metal Backend / test-model-metal-e2e (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-metal) / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 2

CANCELLED JOB - The following job was cancelled. Please retry:

trunk / test-models-macos-cpu (llama3_2_vision_encoder, portable) / macos-job (gh)
##[error]The operation was canceled.

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Test CUDA Windows Export and E2E / test-model-cuda-windows-e2e (mistralai, Voxtral-Mini-3B-2507, non-quantized) / windows-job (gh) (trunk failure)
Process completed with exit code 1.
Test CUDA Windows Export and E2E / test-model-cuda-windows-e2e (mistralai, Voxtral-Mini-3B-2507, quantized-int4-weight-only) / windows-job (gh) (trunk failure)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-02-26T18:40:49Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Copilot

Pull request overview

Removes the legacy Cortex-M post-processing step from aot_arm_compiler.py and makes the delegation pipeline more explicit by applying remaining Q/DQ cleanup directly in the delegated export flow.

Changes:

Removed transform_for_cortex_m_backend() and the --enable_qdq_fusion_pass CLI flag.
Dropped unused Cortex-M fusion/convert pass imports tied to the removed flag.
Applied ReplaceQuantNodesPass inside to_edge_TOSA_delegate() to handle boundary quantized_decomposed::* ops after lowering.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

examples/arm/aot_arm_compiler.py

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

examples/arm/aot_arm_compiler.py

…pass flag Summary: Remove the transform_for_cortex_m_backend() function and the --enable_qdq_fusion_pass CLI flag from aot_arm_compiler.py. The function applied Cortex-M passes as a post-hoc step to all non-VGF targets, which made the compilation flow hard to follow and coupled the delegation path to Cortex-M-specific logic. Instead, ReplaceQuantNodesPass is now applied directly inside to_edge_TOSA_delegate() to handle any boundary quantized_decomposed::* nodes that remain outside the delegated subgraph. This makes the delegation path self-contained and explicit about its runtime requirements. This change is in preparation for an upcoming PR (#17075) that introduces Cortex-M as a first-class compilation target with its own dedicated pipeline, including CortexMQuantizer and CortexMPassManager.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

examples/arm/aot_arm_compiler.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

examples/arm/aot_arm_compiler.py

Copilot · 2026-02-27T04:17:56Z

examples/arm/aot_arm_compiler.py

    parser.add_argument(
        "--non_strict_export",
        dest="strict_export",
        required=False,
        action="store_false",
        help="Disable strict checking while exporting models.",
    )


This PR removes the --enable_qdq_fusion_pass CLI flag, but examples/arm/run.sh still constructs and passes --enable_qdq_fusion_pass (see run.sh around the qdq_fusion_op_flag assignment). That script will now fail with an “unrecognized arguments” error; update the script (and any other callers) to stop using the removed flag.

Will clean it up in follow up PR

If "One way is to keep the flag but it it does nothing more the print that is deprecated?" is done that would be OK (Se my other comment)

But if NOT it will break run.sh args for a while (as the flag can the used from it) so in that case its better to try to fix both in same PR to keep stuff "working" all the time.

Added deprecated text to the flag in this PR,

Updated the _apply_replace_quant_nodes function to accept a generic edge argument instead of EdgeProgramManager.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated no new comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

examples/arm/aot_arm_compiler.py

Remove deprecated --enable_qdq_fusion_pass argument and related logging.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

examples/arm/aot_arm_compiler.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

examples/arm/aot_arm_compiler.py

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated no new comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot AI review requested due to automatic review settings February 26, 2026 18:40

psiddh requested a review from digantdesai as a code owner February 26, 2026 18:40

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 26, 2026

Copilot started reviewing on behalf of psiddh February 26, 2026 18:40 View session

psiddh added the ciflow/trunk label Feb 26, 2026

psiddh mentioned this pull request Feb 26, 2026

Arm backend: Add Cortex-M as a first-class target in aot_arm_compiler #17075

Open

Copilot AI reviewed Feb 26, 2026

View reviewed changes

examples/arm/aot_arm_compiler.py Outdated Show resolved Hide resolved

psiddh force-pushed the aot_pre branch from 44576d9 to 1805ef0 Compare February 26, 2026 20:33

Copilot AI review requested due to automatic review settings February 26, 2026 23:41

Copilot started reviewing on behalf of psiddh February 26, 2026 23:42 View session

Copilot AI reviewed Feb 26, 2026

View reviewed changes

examples/arm/aot_arm_compiler.py Outdated Show resolved Hide resolved

examples/arm/aot_arm_compiler.py Outdated Show resolved Hide resolved

examples/arm/aot_arm_compiler.py Outdated Show resolved Hide resolved

psiddh force-pushed the aot_pre branch from 837ca7c to 7a4d15b Compare February 27, 2026 02:44

psiddh requested review from Copilot and zingo February 27, 2026 04:06

Copilot started reviewing on behalf of psiddh February 27, 2026 04:06 View session

psiddh requested review from AdrianLundell and rascani February 27, 2026 04:06

Copilot AI reviewed Feb 27, 2026

View reviewed changes

examples/arm/aot_arm_compiler.py Outdated Show resolved Hide resolved

examples/arm/aot_arm_compiler.py Show resolved Hide resolved

Fix formatting of docstring in _apply_replace_quant_nodes

d69df60

psiddh force-pushed the aot_pre branch from 877d91a to d69df60 Compare February 27, 2026 04:11

Apply suggestion from @Copilot

6fea499

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings February 27, 2026 04:13

Apply suggestion from @Copilot

45b4c4c

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot started reviewing on behalf of psiddh February 27, 2026 04:14 View session

Copilot AI reviewed Feb 27, 2026

View reviewed changes

Refactor _apply_replace_quant_nodes function signature

be42e3b

Updated the _apply_replace_quant_nodes function to accept a generic edge argument instead of EdgeProgramManager.

Copilot AI review requested due to automatic review settings February 27, 2026 04:18

Copilot started reviewing on behalf of psiddh February 27, 2026 04:18 View session

Copilot AI reviewed Feb 27, 2026

View reviewed changes

zingo added the partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm label Feb 27, 2026

zingo changed the title ~~Remove legacy transform_for_cortex_m_backend and --enable_qdq_fusion_…~~ Arm backend: Remove legacy transform_for_cortex_m_backend and --enable_qdq_fusion_… Feb 27, 2026

zingo changed the title ~~Arm backend: Remove legacy transform_for_cortex_m_backend and --enable_qdq_fusion_…~~ Arm backend: Remove old transform_for_cortex_m_backend and --enable_qdq_fusion_pass Feb 27, 2026

Merge branch 'main' into aot_pre

13b7ddc

zingo requested changes Feb 27, 2026

View reviewed changes

examples/arm/aot_arm_compiler.py Show resolved Hide resolved

Remove deprecated QDQ fusion pass argument

0bba11c

Remove deprecated --enable_qdq_fusion_pass argument and related logging.

Copilot AI review requested due to automatic review settings February 27, 2026 15:53

Copilot started reviewing on behalf of psiddh February 27, 2026 15:54 View session

Copilot AI reviewed Feb 27, 2026

View reviewed changes

examples/arm/aot_arm_compiler.py Outdated Show resolved Hide resolved

examples/arm/aot_arm_compiler.py Outdated Show resolved Hide resolved

examples/arm/aot_arm_compiler.py Outdated Show resolved Hide resolved

Update examples/arm/aot_arm_compiler.py

18c4295

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings February 27, 2026 15:58

Copilot started reviewing on behalf of psiddh February 27, 2026 15:59 View session

Copilot AI reviewed Feb 27, 2026

View reviewed changes

examples/arm/aot_arm_compiler.py Outdated Show resolved Hide resolved

examples/arm/aot_arm_compiler.py Show resolved Hide resolved

Clarify comments for quantized node replacements

7d11bdb

Copilot AI review requested due to automatic review settings February 27, 2026 16:05

Copilot started reviewing on behalf of psiddh February 27, 2026 16:06 View session

Copilot AI reviewed Feb 27, 2026

View reviewed changes

Conversation

psiddh commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17740

❌ 4 New Failures, 1 Cancelled Job, 2 Unrelated Failures

Uh oh!

github-actions bot commented Feb 26, 2026

This PR needs a release notes: label

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

psiddh Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

zingo Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

psiddh Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

psiddh commented Feb 26, 2026 •

edited

Loading

pytorch-bot bot commented Feb 26, 2026 •

edited

Loading

This PR needs a `release notes:` label

zingo Feb 27, 2026 •

edited

Loading