diff --git a/examples/gpt-5/prompt-optimization-cookbook.ipynb b/examples/gpt-5/prompt-optimization-cookbook.ipynb index d423fb9fc2..a692280b16 100644 --- a/examples/gpt-5/prompt-optimization-cookbook.ipynb +++ b/examples/gpt-5/prompt-optimization-cookbook.ipynb @@ -953,7 +953,7 @@ "id": "0a84939c", "metadata": {}, "source": [ - "GPT-5-mini crushes this task, so even the baseline prompt gets scores of >= 4 almost all of the time. However if we compare the percent of perfect scores (6/6) for the judge, we see that the optimize prompt has way significantly more perfect answers when evaluated in the two categories of FailSafeQA answer quality: robustness and context grounding." + "GPT-5-mini crushes this task, so even the baseline prompt gets scores of >= 4 almost all of the time. However, if we compare the percent of perfect scores (6/6) for the judge, we see that the optimized prompt has way more perfect answers when evaluated in the two categories of FailSafeQA answer quality: robustness and context grounding." ] }, {