Skip to content

Commit b5a317e

Browse files
committed
Remove source compilation of nixl dependency
1 parent a5b84f1 commit b5a317e

File tree

4 files changed

+4
-117
lines changed

4 files changed

+4
-117
lines changed

docker/Dockerfile

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -441,11 +441,9 @@ RUN --mount=type=cache,target=/root/.cache/uv \
441441

442442
# Install EP kernels(pplx-kernels and DeepEP), NixL
443443
COPY tools/ep_kernels/install_python_libraries.sh install_python_libraries.sh
444-
COPY tools/install_nixl.sh install_nixl.sh
445444
ENV CUDA_HOME=/usr/local/cuda
446445
RUN export TORCH_CUDA_ARCH_LIST="${TORCH_CUDA_ARCH_LIST:-9.0a+PTX}" \
447-
&& bash install_python_libraries.sh \
448-
&& bash install_nixl.sh --force
446+
&& bash install_python_libraries.sh
449447

450448
#################### vLLM installation IMAGE ####################
451449

docs/serving/expert_parallel_deployment.md

Lines changed: 2 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,6 @@ Before using EP, you need to install the necessary dependencies. We are actively
1010

1111
1. **Install DeepEP and pplx-kernels**: Set up host environment following vLLM's guide for EP kernels [here](gh-file:tools/ep_kernels).
1212
2. **Install DeepGEMM library**: Follow the [official instructions](https://github.com/deepseek-ai/DeepGEMM#installation).
13-
3. **For disaggregated serving**: Install UCX and NIXL following the [script](gh-file:tools/install_nixl.sh).
1413

1514
### Backend Selection Guide
1615

@@ -191,11 +190,9 @@ For production deployments requiring strict SLA guarantees for time-to-first-tok
191190

192191
### Setup Steps
193192

194-
1. **Install KV Connector**: Install NIXL using the [installation script](gh-file:tools/install_nixl.sh)
193+
1. **Configure Both Instances**: Add this flag to both prefill and decode instances `--kv-transfer-config '{"kv_connector":"NixlConnector","kv_role":"kv_both"}`
195194

196-
2. **Configure Both Instances**: Add this flag to both prefill and decode instances `--kv-transfer-config '{"kv_connector":"NixlConnector","kv_role":"kv_both"}`
197-
198-
3. **Client Orchestration**: Use the client-side script below to coordinate prefill/decode operations. We are actively working on routing solutions.
195+
2. **Client Orchestration**: Use the client-side script below to coordinate prefill/decode operations. We are actively working on routing solutions.
199196

200197
### Client Orchestration Example
201198

requirements/common.txt

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -49,3 +49,4 @@ pybase64 # fast base64 implementation
4949
cbor2 # Required for cross-language serialization of hashable objects
5050
setproctitle # Used to set process names for better debugging and monitoring
5151
openai-harmony >= 0.0.3 # Required for gpt-oss
52+
nixl >= 0.5.1 # Required for disaggregated prefill

tools/install_nixl.sh

Lines changed: 0 additions & 109 deletions
This file was deleted.

0 commit comments

Comments
 (0)