Skip to content

[pull] main from pytorch:main#1088

Merged
pull[bot] merged 1 commit intoMPACT-ORG:mainfrom
pytorch:main
Apr 18, 2026
Merged

[pull] main from pytorch:main#1088
pull[bot] merged 1 commit intoMPACT-ORG:mainfrom
pytorch:main

Conversation

@pull
Copy link
Copy Markdown

@pull pull bot commented Apr 18, 2026

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.4)

Can you help keep this open source service alive? 💖 Please sponsor : )

…utotuner.run() (#179424) (#179424)

Summary:

Refactors the tail of `CachingAutotuner.run()` to separate launch
instrumentation from the core kernel dispatch:

- `_debug_call` is promoted from a local variable in `run()` to an
  instance attribute (initialized to None in __init__), so that
  `_post_launch` can access it without it being passed through the call.

- `_pre_launch(launcher, *args, stream, **kwargs)`: extracts param
  dumping (`dump_launch_params`), tensor dumping (`dump_launch_tensors`),
  and profiler context-manager setup. Returns either a
  `_RecordFunctionFast` context or `nullcontext`.

- `_post_launch()`: extracts the debug-mode finalization
  (`debug_call.finalize`).

- `run()` is simplified to: call `_pre_launch`, enter its context
  manager around `launcher(...)`, call `_post_launch`.

No behavioral change.

See D99545697 for why we'd like to make this change.

Test Plan:
Run triton heuristics unit tests:

  buck test fbcode//mode/opt \
      -m ovr_config//triton:beta \
      -c fbcode.platform010_cuda_version=12.8 \
      fbcode//caffe2/test/inductor:triton_heuristics

Results: Ran 19 tests — 15 passed, 4 skipped, 0 failed

Run the cold-run benchmark suite to verify end-to-end behavior:

  buck run fbcode//mode/opt \
      -m ovr_config//triton:beta \
      -c fbcode.platform010_cuda_version=12.8 \
      fbcode//scripts/nmacchioni/aps-V4_0316tk_128gpu_coldrun_0324-8cae9b57d8:runner \
      -- --times 10 --repeat 1

Results: (total runner duration to be filled after run)

Differential Revision: D99538177

Pull Request resolved: #179424
Approved by: https://github.com/aorenste, https://github.com/jansel
@pull pull bot locked and limited conversation to collaborators Apr 18, 2026
@pull pull bot added the ⤵️ pull label Apr 18, 2026
@pull pull bot merged commit fe2fa45 into MPACT-ORG:main Apr 18, 2026
1 check failed
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant