Extract code blocks only after Code marker #1223

albertvillanova · 2025-04-18T13:32:36Z

Extract code blocks only after Code marker.

albertvillanova · 2025-04-18T13:34:34Z

src/smolagents/utils.py

@@ -187,6 +187,7 @@ def parse_code_blobs(text: str) -> str:
        ValueError: If no valid code block is found in the text.
    """
    pattern = r"```(?:py|python)?\s*\n(.*?)\n```"
+    text = text.split("Code:")[-1]


This change assumes that only one "Code:" marker appears in the model output.

@aymeric-roucher do you think this is a sensible assumption? Alternative assumptions?

aymeric-roucher · 2025-04-18T13:43:04Z

I think it's more reasonable to assume that the model will often forget to put the "Code:" header. To solve #1219, it would be more adapted IMO to just enforce in the regex "header is py or python" rather than "Code: has been generated before the curent code blob"

albertvillanova · 2025-04-18T13:56:11Z

This PR does not enforce the presence of the "Code:" marker: it can handle model output with or without "Code:" marker.

See test case: https://github.com/huggingface/smolagents/pull/1223/files#diff-33c13e0b177bacd2f02e29bcb8aea5b49e7ce34901fd8f41fefb65defba1bd33R164

Additionally, note that the word "py" or "python" after the triple backtick is always optional.

My question above was:

Can we assume that the model only outputs one-or-zero "Code:" marker? This is covered in this PR.
Or could it output multiple "Code:" markers? This is not covered in this PR.

aymeric-roucher · 2025-04-22T08:34:43Z

To clarify: I know that currently the header py or python is optional (cf the regex).
The goal here is to differentiate markdown code blobs that are not the agent's action (could be for instance the LLM generating a plan in pure markdown and separating it from the rest of its generation with triple backticks) from the code blobs that are the agent's real action in python.

Sometimes LLM generate their action in 2 parts, and forget to put the "Code:" header

This is why, I think it makes more sense to enforce "action code blobs have a mandatory header py or python" (which is not currently the case) than to enforce "action code blobs have the 'Code:' sequence before them" as this PR is currently proposing.

albertvillanova · 2025-04-22T08:53:26Z

Thanks for the clarification, @aymeric-roucher.

Just a naive question: do you think it could be plausible that the model might generate a py/python code block within the "Thought" section (e.g., as part of reasoning or planning), which should not be parsed as an action code block?

If so, maybe a combined approach would be more solid...

Curious to hear your thoughts on that edge case.

aymeric-roucher · 2025-04-23T09:06:29Z

@albertvillanova it is possible indeed! But in terms of reducing false positives / false negatives, I think the solution "force the heading with py or python" could have some false positives, but it will greatly reduce false negatives compared to the solution "Take code blobs only after the Code: sequence"

albertvillanova added 2 commits April 18, 2025 15:30

Test parse_code_blobs with Thought/Code

2cb6088

Extract code blocks only after Code: marker

d08b978

albertvillanova commented Apr 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Extract code blocks only after Code marker #1223

Extract code blocks only after Code marker #1223

Uh oh!

albertvillanova commented Apr 18, 2025

Uh oh!

albertvillanova Apr 18, 2025 •

edited

Loading

Uh oh!

aymeric-roucher commented Apr 18, 2025

Uh oh!

albertvillanova commented Apr 18, 2025 •

edited

Loading

Uh oh!

aymeric-roucher commented Apr 22, 2025

Uh oh!

albertvillanova commented Apr 22, 2025 •

edited

Loading

Uh oh!

aymeric-roucher commented Apr 23, 2025

Uh oh!

Uh oh!

Extract code blocks only after Code marker #1223

Are you sure you want to change the base?

Extract code blocks only after Code marker #1223

Uh oh!

Conversation

albertvillanova commented Apr 18, 2025

Uh oh!

albertvillanova Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aymeric-roucher commented Apr 18, 2025

Uh oh!

albertvillanova commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aymeric-roucher commented Apr 22, 2025

Uh oh!

albertvillanova commented Apr 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aymeric-roucher commented Apr 23, 2025

Uh oh!

Uh oh!

albertvillanova Apr 18, 2025 •

edited

Loading

albertvillanova commented Apr 18, 2025 •

edited

Loading

albertvillanova commented Apr 22, 2025 •

edited

Loading