[TPU][Core] Enable Pipeline Parallelism on TPU backend by Chenyaaang · Pull Request #28506 · vllm-project/vllm

Chenyaaang · 2025-11-12T01:55:15Z

Enable Pipeline Parallelism on TPU backend

This pr includes changes on vLLM side:

multiproc_executor.py: Extract _get_parallel_sizes and _post_init_executor methods, so that TPU (and other hardware) can further override them if necessary.
ray_utils.py: Extract _is_intermediate_tensors for TPU to override (to accept jax's version IntermediateTensor)

The command to enable PP on TPU platform is same as other platforms, but PP hasn't been supported on all Jax models, so add env var MODEL_IMPL_TYPE=vllm to use pytorch impl. PP can be used on single host or multi-host (with Ray).

Example command: MODEL_IMPL_TYPE=vllm TPU_BACKEND_TYPE=jax vllm serve Qwen/Qwen3-32B --pipeline-parallel-size 4

chatgpt-codex-connector · 2025-12-16T19:54:01Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.

yaochengji

Thanks for your contribution! My main concern is that the modification looks a little intrusive to me. I'm wondering if we can add a subclass in the tpu-inference repo.

mergify · 2026-01-06T19:08:24Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @Chenyaaang.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Chenyaaang · 2026-01-06T20:00:17Z

Thanks for your contribution! My main concern is that the modification looks a little intrusive to me. I'm wondering if we can add a subclass in the tpu-inference repo.

Here's the pr (creating subclass in tpu_inference) vllm-project/tpu-inference#1401

yaochengji · 2026-01-13T22:17:58Z

@mgoin what do you think about the PR? vllm TPU needs to override these 3 methods (_get_parallel_sizes, _is_driver_worker, _is_intermediate_tensors) to support pipeline parallelism

yaochengji · 2026-01-13T22:18:48Z

@Chenyaaang could you please update your description? get_pp_group doesn't exist in the latest version.

Signed-off-by: Chenyaaang <chenyangli@google.com> llama debug Signed-off-by: Chenyaaang <chenyangli@google.com> core debug Signed-off-by: Chenyaaang <chenyangli@google.com> pp for single host Signed-off-by: Chenyaaang <chenyangli@google.com> pp single host Signed-off-by: Chenyaaang <chenyangli@google.com> pp single host comment Signed-off-by: Chenyaaang <chenyangli@google.com> amend single host Signed-off-by: Chenyaaang <chenyangli@google.com> single host Signed-off-by: Chenyaaang <chenyangli@google.com>

Signed-off-by: Chenyaaang <chenyangli@google.com> amend ray Signed-off-by: Chenyaaang <chenyangli@google.com>

Signed-off-by: Chenyaaang <chenyangli@google.com>

…circular import Signed-off-by: Chenyaaang <chenyangli@google.com>

Signed-off-by: Chenyaaang <chenyangli@google.com>

…erride this method Signed-off-by: Chenyaaang <chenyangli@google.com>

Signed-off-by: Chenyaaang <chenyangli@google.com>

yaochengji

LGTM, thanks!

### What this PR does / why we need it? 1. ✅ Upgrade vllm commit to: 0115 (8471b27) Modify import paths due to the refactors： vllm-project/vllm#32245 vllm-project/vllm#32060 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21034239336/job/60490156965?pr=5913 2. ✅Upgrade vllm commit to: 0119 (9a1f16d) Fix `WorkerProc.__init__() missing 1 required positional argument: 'is_driver_worker'` due to vllm-project/vllm#28506 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21156263050/job/60841668755?5569 3. ✅Upgrade vllm commit to: 0120(148117e) 1. Add `skip_compiled` param in `set_forward_context` due to vllm-project/vllm#30385 2. Modify `tests/ut/spec_decode/test_eagle_proposer.py` due to vllm-project/vllm#24322 change `self.max_num_tokens = vllm_config.scheduler_config.max_num_batched_tokens + max_batch_size` 3. Modify UT import paths due to the refactors：vllm-project/vllm#32060 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21204851770/job/60999046946 4. ✅Upgrade vllm commit to: 0121(f23fb5a) 1. vLLM switched `uses_mrope` from target to draft model config, making `positions`/`mrope_positions` mutually exclusive, breaking vllm-ascend's direct self.positions access and tests missing `draft_model_config.uses_mrope`. vllm-project/vllm#32048 2. Moved bs_to_padded_graph_size from CompilationConfig to CudagraphDispatcher due to the refactor vllm-project/vllm#30143 3. Remove unused `maybe_setup_kv_connector` due to vllm-project/vllm#32077 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21217728738/job/61043738834 6. ✅Upgrade vllm commit to: 0122(8ebf271) Updating FusedMoEParallelConfig (added enable_eplb) and FusedMoEConfig due to vllm-project/vllm#32414 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21249922546/job/61148613054 8. ✅Upgrade vllm commit to: 0123(dc917cc) Setting temperature=0.0 due to the removal of the default temperature value in vllm-project/vllm#32723 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21280796875 ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.14.0 - vLLM main: vllm-project/vllm@d682094 --------- Signed-off-by: wjunLu <wjunlu217@gmail.com> Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com> Co-authored-by: wjunLu <wjunlu217@gmail.com>

### What this PR does / why we need it? 1. ✅ Upgrade vllm commit to: 0115 (8471b27) Modify import paths due to the refactors： vllm-project/vllm#32245 vllm-project/vllm#32060 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21034239336/job/60490156965?pr=5913 2. ✅Upgrade vllm commit to: 0119 (9a1f16d) Fix `WorkerProc.__init__() missing 1 required positional argument: 'is_driver_worker'` due to vllm-project/vllm#28506 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21156263050/job/60841668755?5569 3. ✅Upgrade vllm commit to: 0120(148117e) 1. Add `skip_compiled` param in `set_forward_context` due to vllm-project/vllm#30385 2. Modify `tests/ut/spec_decode/test_eagle_proposer.py` due to vllm-project/vllm#24322 change `self.max_num_tokens = vllm_config.scheduler_config.max_num_batched_tokens + max_batch_size` 3. Modify UT import paths due to the refactors：vllm-project/vllm#32060 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21204851770/job/60999046946 4. ✅Upgrade vllm commit to: 0121(f23fb5a) 1. vLLM switched `uses_mrope` from target to draft model config, making `positions`/`mrope_positions` mutually exclusive, breaking vllm-ascend's direct self.positions access and tests missing `draft_model_config.uses_mrope`. vllm-project/vllm#32048 2. Moved bs_to_padded_graph_size from CompilationConfig to CudagraphDispatcher due to the refactor vllm-project/vllm#30143 3. Remove unused `maybe_setup_kv_connector` due to vllm-project/vllm#32077 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21217728738/job/61043738834 6. ✅Upgrade vllm commit to: 0122(8ebf271) Updating FusedMoEParallelConfig (added enable_eplb) and FusedMoEConfig due to vllm-project/vllm#32414 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21249922546/job/61148613054 8. ✅Upgrade vllm commit to: 0123(dc917cc) Setting temperature=0.0 due to the removal of the default temperature value in vllm-project/vllm#32723 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21280796875 ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.14.0 - vLLM main: vllm-project/vllm@d682094 --------- Signed-off-by: wjunLu <wjunlu217@gmail.com> Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com> Co-authored-by: wjunLu <wjunlu217@gmail.com> Signed-off-by: momochenchuw <chenchuw@huawei.com>

### What this PR does / why we need it? 1. ✅ Upgrade vllm commit to: 0115 (8471b27) Modify import paths due to the refactors： vllm-project/vllm#32245 vllm-project/vllm#32060 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21034239336/job/60490156965?pr=5913 2. ✅Upgrade vllm commit to: 0119 (9a1f16d) Fix `WorkerProc.__init__() missing 1 required positional argument: 'is_driver_worker'` due to vllm-project/vllm#28506 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21156263050/job/60841668755?5569 3. ✅Upgrade vllm commit to: 0120(148117e) 1. Add `skip_compiled` param in `set_forward_context` due to vllm-project/vllm#30385 2. Modify `tests/ut/spec_decode/test_eagle_proposer.py` due to vllm-project/vllm#24322 change `self.max_num_tokens = vllm_config.scheduler_config.max_num_batched_tokens + max_batch_size` 3. Modify UT import paths due to the refactors：vllm-project/vllm#32060 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21204851770/job/60999046946 4. ✅Upgrade vllm commit to: 0121(f23fb5a) 1. vLLM switched `uses_mrope` from target to draft model config, making `positions`/`mrope_positions` mutually exclusive, breaking vllm-ascend's direct self.positions access and tests missing `draft_model_config.uses_mrope`. vllm-project/vllm#32048 2. Moved bs_to_padded_graph_size from CompilationConfig to CudagraphDispatcher due to the refactor vllm-project/vllm#30143 3. Remove unused `maybe_setup_kv_connector` due to vllm-project/vllm#32077 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21217728738/job/61043738834 6. ✅Upgrade vllm commit to: 0122(8ebf271) Updating FusedMoEParallelConfig (added enable_eplb) and FusedMoEConfig due to vllm-project/vllm#32414 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21249922546/job/61148613054 8. ✅Upgrade vllm commit to: 0123(dc917cc) Setting temperature=0.0 due to the removal of the default temperature value in vllm-project/vllm#32723 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21280796875 ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.14.0 - vLLM main: vllm-project/vllm@d682094 --------- Signed-off-by: wjunLu <wjunlu217@gmail.com> Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com> Co-authored-by: wjunLu <wjunlu217@gmail.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

### What this PR does / why we need it? 1. ✅ Upgrade vllm commit to: 0115 (8471b27) Modify import paths due to the refactors： vllm-project/vllm#32245 vllm-project/vllm#32060 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21034239336/job/60490156965?pr=5913 2. ✅Upgrade vllm commit to: 0119 (9a1f16d) Fix `WorkerProc.__init__() missing 1 required positional argument: 'is_driver_worker'` due to vllm-project/vllm#28506 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21156263050/job/60841668755?5569 3. ✅Upgrade vllm commit to: 0120(148117e) 1. Add `skip_compiled` param in `set_forward_context` due to vllm-project/vllm#30385 2. Modify `tests/ut/spec_decode/test_eagle_proposer.py` due to vllm-project/vllm#24322 change `self.max_num_tokens = vllm_config.scheduler_config.max_num_batched_tokens + max_batch_size` 3. Modify UT import paths due to the refactors：vllm-project/vllm#32060 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21204851770/job/60999046946 4. ✅Upgrade vllm commit to: 0121(f23fb5a) 1. vLLM switched `uses_mrope` from target to draft model config, making `positions`/`mrope_positions` mutually exclusive, breaking vllm-ascend's direct self.positions access and tests missing `draft_model_config.uses_mrope`. vllm-project/vllm#32048 2. Moved bs_to_padded_graph_size from CompilationConfig to CudagraphDispatcher due to the refactor vllm-project/vllm#30143 3. Remove unused `maybe_setup_kv_connector` due to vllm-project/vllm#32077 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21217728738/job/61043738834 6. ✅Upgrade vllm commit to: 0122(8ebf271) Updating FusedMoEParallelConfig (added enable_eplb) and FusedMoEConfig due to vllm-project/vllm#32414 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21249922546/job/61148613054 8. ✅Upgrade vllm commit to: 0123(dc917cc) Setting temperature=0.0 due to the removal of the default temperature value in vllm-project/vllm#32723 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21280796875 ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.14.0 - vLLM main: vllm-project/vllm@d682094 --------- Signed-off-by: wjunLu <wjunlu217@gmail.com> Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com> Co-authored-by: wjunLu <wjunlu217@gmail.com>

### What this PR does / why we need it? 1. ✅ Upgrade vllm commit to: 0115 (8471b27) Modify import paths due to the refactors： vllm-project/vllm#32245 vllm-project/vllm#32060 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21034239336/job/60490156965?pr=5913 2. ✅Upgrade vllm commit to: 0119 (9a1f16d) Fix `WorkerProc.__init__() missing 1 required positional argument: 'is_driver_worker'` due to vllm-project/vllm#28506 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21156263050/job/60841668755?5569 3. ✅Upgrade vllm commit to: 0120(148117e) 1. Add `skip_compiled` param in `set_forward_context` due to vllm-project/vllm#30385 2. Modify `tests/ut/spec_decode/test_eagle_proposer.py` due to vllm-project/vllm#24322 change `self.max_num_tokens = vllm_config.scheduler_config.max_num_batched_tokens + max_batch_size` 3. Modify UT import paths due to the refactors：vllm-project/vllm#32060 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21204851770/job/60999046946 4. ✅Upgrade vllm commit to: 0121(f23fb5a) 1. vLLM switched `uses_mrope` from target to draft model config, making `positions`/`mrope_positions` mutually exclusive, breaking vllm-ascend's direct self.positions access and tests missing `draft_model_config.uses_mrope`. vllm-project/vllm#32048 2. Moved bs_to_padded_graph_size from CompilationConfig to CudagraphDispatcher due to the refactor vllm-project/vllm#30143 3. Remove unused `maybe_setup_kv_connector` due to vllm-project/vllm#32077 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21217728738/job/61043738834 6. ✅Upgrade vllm commit to: 0122(8ebf271) Updating FusedMoEParallelConfig (added enable_eplb) and FusedMoEConfig due to vllm-project/vllm#32414 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21249922546/job/61148613054 8. ✅Upgrade vllm commit to: 0123(dc917cc) Setting temperature=0.0 due to the removal of the default temperature value in vllm-project/vllm#32723 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21280796875 ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.14.0 - vLLM main: vllm-project/vllm@d682094 --------- Signed-off-by: wjunLu <wjunlu217@gmail.com> Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com> Co-authored-by: wjunLu <wjunlu217@gmail.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

### What this PR does / why we need it? 1. ✅ Upgrade vllm commit to: 0115 (8471b27) Modify import paths due to the refactors： vllm-project/vllm#32245 vllm-project/vllm#32060 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21034239336/job/60490156965?pr=5913 2. ✅Upgrade vllm commit to: 0119 (9a1f16d) Fix `WorkerProc.__init__() missing 1 required positional argument: 'is_driver_worker'` due to vllm-project/vllm#28506 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21156263050/job/60841668755?5569 3. ✅Upgrade vllm commit to: 0120(148117e) 1. Add `skip_compiled` param in `set_forward_context` due to vllm-project/vllm#30385 2. Modify `tests/ut/spec_decode/test_eagle_proposer.py` due to vllm-project/vllm#24322 change `self.max_num_tokens = vllm_config.scheduler_config.max_num_batched_tokens + max_batch_size` 3. Modify UT import paths due to the refactors：vllm-project/vllm#32060 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21204851770/job/60999046946 4. ✅Upgrade vllm commit to: 0121(f23fb5a) 1. vLLM switched `uses_mrope` from target to draft model config, making `positions`/`mrope_positions` mutually exclusive, breaking vllm-ascend's direct self.positions access and tests missing `draft_model_config.uses_mrope`. vllm-project/vllm#32048 2. Moved bs_to_padded_graph_size from CompilationConfig to CudagraphDispatcher due to the refactor vllm-project/vllm#30143 3. Remove unused `maybe_setup_kv_connector` due to vllm-project/vllm#32077 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21217728738/job/61043738834 6. ✅Upgrade vllm commit to: 0122(8ebf271) Updating FusedMoEParallelConfig (added enable_eplb) and FusedMoEConfig due to vllm-project/vllm#32414 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21249922546/job/61148613054 8. ✅Upgrade vllm commit to: 0123(dc917cc) Setting temperature=0.0 due to the removal of the default temperature value in vllm-project/vllm#32723 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21280796875 ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.14.0 - vLLM main: vllm-project/vllm@d682094 --------- Signed-off-by: wjunLu <wjunlu217@gmail.com> Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com> Co-authored-by: wjunLu <wjunlu217@gmail.com>

…28506) Signed-off-by: Chenyaaang <chenyangli@google.com>

### What this PR does / why we need it? 1. ✅ Upgrade vllm commit to: 0115 (8471b27) Modify import paths due to the refactors： vllm-project/vllm#32245 vllm-project/vllm#32060 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21034239336/job/60490156965?pr=5913 2. ✅Upgrade vllm commit to: 0119 (9a1f16d) Fix `WorkerProc.__init__() missing 1 required positional argument: 'is_driver_worker'` due to vllm-project/vllm#28506 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21156263050/job/60841668755?5569 3. ✅Upgrade vllm commit to: 0120(148117e) 1. Add `skip_compiled` param in `set_forward_context` due to vllm-project/vllm#30385 2. Modify `tests/ut/spec_decode/test_eagle_proposer.py` due to vllm-project/vllm#24322 change `self.max_num_tokens = vllm_config.scheduler_config.max_num_batched_tokens + max_batch_size` 3. Modify UT import paths due to the refactors：vllm-project/vllm#32060 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21204851770/job/60999046946 4. ✅Upgrade vllm commit to: 0121(f23fb5a) 1. vLLM switched `uses_mrope` from target to draft model config, making `positions`/`mrope_positions` mutually exclusive, breaking vllm-ascend's direct self.positions access and tests missing `draft_model_config.uses_mrope`. vllm-project/vllm#32048 2. Moved bs_to_padded_graph_size from CompilationConfig to CudagraphDispatcher due to the refactor vllm-project/vllm#30143 3. Remove unused `maybe_setup_kv_connector` due to vllm-project/vllm#32077 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21217728738/job/61043738834 6. ✅Upgrade vllm commit to: 0122(8ebf271) Updating FusedMoEParallelConfig (added enable_eplb) and FusedMoEConfig due to vllm-project/vllm#32414 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21249922546/job/61148613054 8. ✅Upgrade vllm commit to: 0123(dc917cc) Setting temperature=0.0 due to the removal of the default temperature value in vllm-project/vllm#32723 Test result: https://github.com/vllm-project/vllm-ascend/actions/runs/21280796875 ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.14.0 - vLLM main: vllm-project/vllm@d682094 --------- Signed-off-by: wjunLu <wjunlu217@gmail.com> Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com> Co-authored-by: wjunLu <wjunlu217@gmail.com> Signed-off-by: nanxing <1014662416@qq.com>

…28506) Signed-off-by: Chenyaaang <chenyangli@google.com>

mergify Bot added the v1 label Nov 12, 2025

Chenyaaang mentioned this pull request Nov 12, 2025

Enable Pipeline Parallelism on Ray vllm-project/tpu-inference#1078

Merged

8 tasks

Chenyaaang force-pushed the pp-vllm branch from 9d5b93e to e83bc4c Compare November 12, 2025 02:26

Chenyaaang force-pushed the pp-vllm branch from e83bc4c to 1ba301a Compare November 20, 2025 01:45

Chenyaaang force-pushed the pp-vllm branch 2 times, most recently from 0af58b8 to 9c427be Compare December 16, 2025 00:02

Chenyaaang changed the title ~~Enable PP on tpu_inference~~ Enable Pipeline Parallelism on TPU backend Dec 16, 2025

Chenyaaang marked this pull request as ready for review December 16, 2025 19:53

Chenyaaang changed the title ~~Enable Pipeline Parallelism on TPU backend~~ [TPU][Core] Enable Pipeline Parallelism on TPU backend Dec 16, 2025

Chenyaaang force-pushed the pp-vllm branch from 9c427be to 7c54857 Compare December 16, 2025 20:50

mrjunwan-lang reviewed Dec 17, 2025

View reviewed changes

Comment thread vllm/v1/executor/multiproc_executor.py Outdated

yaochengji reviewed Dec 17, 2025

View reviewed changes

Comment thread vllm/v1/executor/multiproc_executor.py Outdated

Comment thread vllm/v1/executor/multiproc_executor.py Outdated

Comment thread vllm/v1/executor/multiproc_executor.py Outdated

mergify Bot added the needs-rebase label Jan 6, 2026

Chenyaaang force-pushed the pp-vllm branch 2 times, most recently from ab54c87 to 4b78247 Compare January 6, 2026 19:33

mergify Bot removed the needs-rebase label Jan 6, 2026

Chenyaaang mentioned this pull request Jan 6, 2026

Override vLLM classes for PP vllm-project/tpu-inference#1401

Merged

yaochengji added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 13, 2026

Chenyaaang added 7 commits January 14, 2026 05:25

PP on Ray

44c0466

Signed-off-by: Chenyaaang <chenyangli@google.com> amend ray Signed-off-by: Chenyaaang <chenyangli@google.com>

remove unncessary comments

41c9920

Signed-off-by: Chenyaaang <chenyangli@google.com>

move jax's get_pp_group into parallel_state instead of init to avoid …

35e8001

…circular import Signed-off-by: Chenyaaang <chenyangli@google.com>

abstract hardware related methods in multiproc executor

3312fe4

Signed-off-by: Chenyaaang <chenyangli@google.com>

pass is_driver_worker from the executor so that other hardware can ov…

a80f979

…erride this method Signed-off-by: Chenyaaang <chenyangli@google.com>

extract methods for pp ray

70c3eda

Signed-off-by: Chenyaaang <chenyangli@google.com>

revert change in parallel_state

f24f1e7

Signed-off-by: Chenyaaang <chenyangli@google.com>

Chenyaaang force-pushed the pp-vllm branch from 8b22478 to f24f1e7 Compare January 14, 2026 05:31

yaochengji approved these changes Jan 16, 2026

View reviewed changes

yaochengji merged commit 484e22b into vllm-project:main Jan 16, 2026
45 checks passed

Meihan-chen mentioned this pull request Jan 19, 2026

[Main2Main] Upgrade vllm commit to 0122 vllm-project/vllm-ascend#5985

Closed

Meihan-chen mentioned this pull request Jan 21, 2026

[Main2Main] Upgrade vllm commit to 0120 vllm-project/vllm-ascend#6040

Closed

Meihan-chen mentioned this pull request Jan 26, 2026

[Main2Main] Upgrade vllm commit to 0123 vllm-project/vllm-ascend#6169

Merged

mystous pushed a commit to mystous/vllm_hybrid that referenced this pull request May 10, 2026

[TPU][Core] Enable Pipeline Parallelism on TPU backend (vllm-project#…

d2e5fad

…28506) Signed-off-by: Chenyaaang <chenyangli@google.com>

my-other-github-account pushed a commit to my-other-github-account/vllm that referenced this pull request May 15, 2026

[TPU][Core] Enable Pipeline Parallelism on TPU backend (vllm-project#…

9ff32f8

…28506) Signed-off-by: Chenyaaang <chenyangli@google.com>

my-other-github-account pushed a commit to my-other-github-account/vllm that referenced this pull request May 15, 2026

[TPU][Core] Enable Pipeline Parallelism on TPU backend (vllm-project#…

d3c30e3

…28506) Signed-off-by: Chenyaaang <chenyangli@google.com>

0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request May 19, 2026

[TPU][Core] Enable Pipeline Parallelism on TPU backend (vllm-project#…

9b3f6f5

…28506) Signed-off-by: Chenyaaang <chenyangli@google.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[TPU][Core] Enable Pipeline Parallelism on TPU backend#28506

[TPU][Core] Enable Pipeline Parallelism on TPU backend#28506
yaochengji merged 8 commits into
vllm-project:mainfrom
Chenyaaang:pp-vllm

Chenyaaang commented Nov 12, 2025 •

edited by github-actions Bot

Loading

Uh oh!

chatgpt-codex-connector Bot commented Dec 16, 2025

Uh oh!

Uh oh!

yaochengji left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mergify Bot commented Jan 6, 2026

Uh oh!

Chenyaaang commented Jan 6, 2026

Uh oh!

yaochengji commented Jan 13, 2026

Uh oh!

yaochengji commented Jan 13, 2026

Uh oh!

yaochengji left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

Chenyaaang commented Nov 12, 2025 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector Bot commented Dec 16, 2025

Uh oh!

Uh oh!

yaochengji left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mergify Bot commented Jan 6, 2026

Uh oh!

Chenyaaang commented Jan 6, 2026

Uh oh!

yaochengji commented Jan 13, 2026

Uh oh!

yaochengji commented Jan 13, 2026

Uh oh!

yaochengji left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Chenyaaang commented Nov 12, 2025 •

edited by github-actions Bot

Loading