Add performance config for microsoft/Phi-3-medium-4k-instruct #51

tmuttaki · 2025-09-12T20:08:53Z

SUMMARY:

When running vllm with microsoft/Phi-3-medium-4k-instruct we get the following error:

Value error, User-specified max_model_len (8192) is greater than the derived max_model_len (max_position_embeddings=4096 or model_max_length=None in model's config.json). 
This may lead to incorrect model outputs or CUDA errors. 
To allow overriding this maximum, set the env var VLLM_ALLOW_LONG_MAX_MODEL_LEN=1 [type=value_error, input_value=ArgsKwargs((), {'model': ...gits_processors': None}), input_type=ArgsKwargs]

Because it gets the max_model_len from model-configs/common/performance/server.yml

WARNING  neuralmagic.utils:utils.py:104 No performance server config found for microsoft/Phi-3-medium-4k-instruct, using fallback=PosixPath('model-configs/common/performance/server.yml')

Example run: https://github.com/neuralmagic/nm-cicd/actions/runs/17684609008

TEST PLAN:

Adding this performance config solves this issue:
Example run: https://github.com/neuralmagic/nm-cicd/actions/runs/17684808798

derekk-nm · 2025-09-12T23:22:36Z

@tmuttaki , I'm curious why we're testing with this model. It's not listed in the Model Validation tracker.

tarukumar · 2025-09-15T11:22:12Z

@derekk-nm This model is part of Accept Sync, and since we were observing issues with Accept Sync on this model, we wanted to include it in our validation. I see no harm adding the config in the repo. Let us know if you feel otherwise.

derekk-nm · 2025-09-15T12:23:12Z

Ok. makes sense. No harm adding it to the repo. Be wary of the additional time it adds to the Accept Sync, though.

add phi3 perf config

f652b7e

tmuttaki requested review from tarukumar, derekk-nm and heyselbi September 12, 2025 20:13

tarukumar approved these changes Sep 15, 2025

View reviewed changes

derekk-nm approved these changes Sep 15, 2025

View reviewed changes

tarukumar merged commit 847e270 into main Sep 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add performance config for microsoft/Phi-3-medium-4k-instruct #51

Add performance config for microsoft/Phi-3-medium-4k-instruct #51

Uh oh!

tmuttaki commented Sep 12, 2025 •

edited

Loading

Uh oh!

derekk-nm commented Sep 12, 2025

Uh oh!

tarukumar commented Sep 15, 2025

Uh oh!

derekk-nm commented Sep 15, 2025

Uh oh!

Uh oh!

Add performance config for microsoft/Phi-3-medium-4k-instruct #51

Add performance config for microsoft/Phi-3-medium-4k-instruct #51

Uh oh!

Conversation

tmuttaki commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

SUMMARY:

TEST PLAN:

Uh oh!

derekk-nm commented Sep 12, 2025

Uh oh!

tarukumar commented Sep 15, 2025

Uh oh!

derekk-nm commented Sep 15, 2025

Uh oh!

Uh oh!

tmuttaki commented Sep 12, 2025 •

edited

Loading