[Bug] Fix gpt-oss missing tool content #24954

levunet · 2025-09-16T09:06:25Z

Purpose

This PR requires the following PRs to be merged first: #24768 and the harmony lib PR (openai/harmony#76).

The changes include adding 'with_recipient' and the Assistant's 'analysis' content.
Without adding this content, there was an issue where the gpt-oss model had a higher probability of outputting abnormal tokens when calling tools.

Test Plan

gpt-oss_test.py
messages.txt

Run python3 gpt-oss_test.py about 10 times.

Test Result

(before)

(after applying the harmony lib changes)

gemini-code-assist

Code Review

This pull request aims to fix a bug in gpt-oss model tool calls by adding 'analysis' content and with_recipient. The changes are logical and align with the stated purpose. However, I've identified a high-severity issue where the handling of 'analysis' content is not robust for multimodal inputs, which could lead to a runtime crash. I have provided a code suggestion to address this.

gemini-code-assist · 2025-09-16T09:08:35Z

vllm/entrypoints/harmony_utils.py

The logic to extract content is not robust. The content of an assistant message can be a list of parts (e.g., for multimodal inputs), not just a string. The current implementation content = chat_msg.get("content") or "" will cause a runtime error if content is a non-empty list, as Message.from_role_and_content expects a string. The code should handle the case where content is a list by extracting the text parts, similar to how it's handled for other message types in this file.

content = chat_msg.get("content") if isinstance(content, list): # Extract text from multimodal content content = "\n".join( p.get("text", "") for p in content if isinstance(p, dict) and p.get("type") == "text") elif not isinstance(content, str): content = "" analysis_msg = Message.from_role_and_content(Role.ASSISTANT, content) analysis_msg = analysis_msg.with_channel("analysis") msgs.append(analysis_msg)

levunet · 2025-09-16T09:33:55Z

@alecsolder
In the harmony library, the ' to' value should have been encoded as id 316, but occasionally it was incorrectly encoded as a mixture of 220+935 id values, which caused the model to output incorrect tokens.

The changes include adding 'with_recipient' and the Assistant's 'analysis' content. Without adding this content, there was an issue where the gpt-oss model had a higher probability of outputting abnormal tokens when calling tools. Signed-off-by: kyt <[email protected]>

alecsolder · 2025-09-18T15:12:50Z

In your test script, I see you're using the streaming completions endpoint, which I don't think uses the harmony_utils method you modified? I just want to double check I'm reading it right

alecsolder · 2025-09-18T15:18:12Z

Also, thanks for mentioning that huggingface tokenizer change, I hadn't seen it and my snapshot was out of date!

levunet · 2025-09-19T01:17:43Z

Thank you for checking. I've double-checked the part you mentioned.
I can confirm that the parse_chat_input I modified in the harmony_utils.py file is being used without any issues through the flow: /v1/chat/completions -> def create_chat_completion -> def _make_request_with_harmony -> def parse_chat_input to parse request.messages.
The streaming endpoint is also correctly using this modified method.

levunet requested review from aarnphm and chaunceyjiang as code owners September 16, 2025 09:06

mergify bot added frontend gpt-oss Related to GPT-OSS models labels Sep 16, 2025

github-project-automation bot added this to gpt-oss Issues & Enhancements Sep 16, 2025

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Sep 16, 2025

gemini-code-assist bot reviewed Sep 16, 2025

View reviewed changes

levunet force-pushed the feat/gptoss-tool-fix branch 2 times, most recently from c3780bf to eed8815 Compare September 16, 2025 09:19

levunet force-pushed the feat/gptoss-tool-fix branch from eed8815 to 39d6e8e Compare September 16, 2025 14:13

This was referenced Sep 17, 2025

[Bug]: gpt-oss Intermittent 500 Internal Server Error with empty response body when using strict JSON “function router” system prompt #23837

Open

[Feature][Responses API] Stream Function Call #23222

Open

This was referenced Sep 26, 2025

[openai] Fix missing tool usage check (system message) #24768

Open

[Bug] fix render - tool formatting openai/harmony#76

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bug] Fix gpt-oss missing tool content #24954

[Bug] Fix gpt-oss missing tool content #24954

levunet commented Sep 16, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Sep 16, 2025

Uh oh!

levunet commented Sep 16, 2025

Uh oh!

alecsolder commented Sep 18, 2025 •

edited

Loading

Uh oh!

alecsolder commented Sep 18, 2025

Uh oh!

levunet commented Sep 19, 2025

Uh oh!

Uh oh!

Uh oh!

[Bug] Fix gpt-oss missing tool content #24954

Are you sure you want to change the base?

[Bug] Fix gpt-oss missing tool content #24954

Conversation

levunet commented Sep 16, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

levunet commented Sep 16, 2025

Uh oh!

alecsolder commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alecsolder commented Sep 18, 2025

Uh oh!

levunet commented Sep 19, 2025

Uh oh!

Uh oh!

levunet commented Sep 16, 2025 •

edited by github-actions bot

Loading

alecsolder commented Sep 18, 2025 •

edited

Loading