[Chore] Minor simplification for non-PP path #24810

WoosukKwon · 2025-09-13T22:04:49Z

Currently, broadcast_pp_output complicates the logic in the common non-PP path.
This PR simplifies the logic a bit.

Signed-off-by: Woosuk Kwon <[email protected]>

gemini-code-assist

Code Review

This pull request provides a nice simplification to the execute_model method in GPUModelRunner. By moving the broadcast_pp_output flag to the __init__ method and restructuring the logic to handle the common (non-broadcast) and rare (broadcast) paths separately, the code becomes much clearer and easier to follow. This refactoring also correctly handles kv_connector_output for pooling models and appears to fix a latent bug where logits were not updated on non-last pipeline parallel ranks after being broadcast. The changes are well-contained and improve both readability and correctness. I have no further suggestions.

njhill · 2025-09-13T22:36:50Z

vllm/v1/worker/gpu_model_runner.py

-                model_output_broadcast_data = {
-                    "logits": logits.contiguous(),
-                } if logits is not None else {}
+            else:


just a thought, could we put the else logic in a different function to be even less intrusive to the common path?

Signed-off-by: Woosuk Kwon <[email protected]>

…to loader * 'loader' of https://github.com/dsxsteven/vllm_splitPR: (123 commits) [Hybrid Allocator] Support Pipeline Parallel (vllm-project#23974) [Spec Decoding]Support Spec Decoding Metrics in DP Mode (vllm-project#24049) [Chore] Remove ipex_ops warning (vllm-project#24835) Force use C++17 globally to avoid compilation error (vllm-project#24823) [Benchmarks] Throw usage error when using dataset-name random and dataset-path together (vllm-project#24819) fix type of sampling rate for encode_base64 (vllm-project#24826) [Perf] Fix DeepGEMM Contiguous Layout Issue, 5.5% Throughput Improvement (vllm-project#24783) [Misc] Improve `s3_utils` type hints with `BaseClient` (vllm-project#24825) [Multi Modal][Performance] Fused Q,K's apply_rope into one (vllm-project#24511) [Chore] Minor simplification for non-PP path (vllm-project#24810) [Minor] Simplify duplicative device check for cuda (vllm-project#24793) Remove redundant assignment in xfer_buffers, This is a little fix (vllm-project#24732) [CI][Spec Decode] Adjust threshold for flaky ngram spec decoding test again (vllm-project#24771) [Doc]: fix typos in various files (vllm-project#24798) [Misc] Correct an outdated comment. (vllm-project#24765) [CI Failure] Fix test_flashinfer_cutlass_mxfp4_mxfp8_fused_moe (vllm-project#24750) [Core][Multimodal] Cache `supports_kw` (vllm-project#24773) [Kernels][DP/EP] Optimize Silu Kernel for R1 (vllm-project#24054) [Perf] Use NVIDIA hardware-accelerated instruction for float to fp8_e4m3 quantization (vllm-project#24757) [Doc]: Remove 404 hyperlinks (vllm-project#24785) ...

Signed-off-by: Woosuk Kwon <[email protected]>

Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: bbartels <[email protected]>

Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: bruceszchen <[email protected]>

[Chore] Minor simplification for PP

97395c8

Signed-off-by: Woosuk Kwon <[email protected]>

WoosukKwon requested review from robertgshaw2-redhat, njhill, ywang96, comaniac and alexm-redhat as code owners September 13, 2025 22:04

mergify bot added the v1 label Sep 13, 2025

WoosukKwon added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 13, 2025

gemini-code-assist bot reviewed Sep 13, 2025

View reviewed changes

njhill approved these changes Sep 13, 2025

View reviewed changes

WoosukKwon merged commit 3e903b6 into main Sep 14, 2025
58 checks passed

WoosukKwon deleted the woosuk/minor-simpl-pool branch September 14, 2025 00:41

BoyuanFeng pushed a commit to BoyuanFeng/vllm that referenced this pull request Sep 14, 2025

[Chore] Minor simplification for non-PP path (vllm-project#24810)

d4946ce

Signed-off-by: Woosuk Kwon <[email protected]>

dsxsteven pushed a commit to dsxsteven/vllm_splitPR that referenced this pull request Sep 15, 2025

[Chore] Minor simplification for non-PP path (vllm-project#24810)

e240c90

Signed-off-by: Woosuk Kwon <[email protected]>

bbartels pushed a commit to bbartels/vllm that referenced this pull request Sep 15, 2025

[Chore] Minor simplification for non-PP path (vllm-project#24810)

47775f1

Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: bbartels <[email protected]>

cboss6 pushed a commit to cboss6/vllm that referenced this pull request Sep 16, 2025

[Chore] Minor simplification for non-PP path (vllm-project#24810)

7cb6f8c

Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: bruceszchen <[email protected]>

cboss6 pushed a commit to cboss6/vllm that referenced this pull request Sep 16, 2025

[Chore] Minor simplification for non-PP path (vllm-project#24810)

9c34ebf

Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: bruceszchen <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Chore] Minor simplification for non-PP path #24810

[Chore] Minor simplification for non-PP path #24810

Uh oh!

WoosukKwon commented Sep 13, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

njhill Sep 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Chore] Minor simplification for non-PP path #24810

[Chore] Minor simplification for non-PP path #24810

Uh oh!

Conversation

WoosukKwon commented Sep 13, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

njhill Sep 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

WoosukKwon commented Sep 13, 2025 •

edited by github-actions bot

Loading

njhill Sep 13, 2025 •

edited

Loading