[Frontend] Vendor exported templates to `vllm.tools` #18094

aarnphm · 2025-05-13T18:45:04Z

This PR vendors all exported tool calling / chat templates from examples to vllm/tools such that end users will now don't have to clone the repo down to use the templates

The format is as follows:

vllm serve <model> --chat-template tool_chat_template_hermes

Previous functionalities are still preserved, if users want to use custom templates path.

Signed-off-by: Aaron Pham [email protected]

Signed-off-by: Aaron Pham <[email protected]>

github-actions · 2025-05-13T18:45:16Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: Aaron Pham <[email protected]>

DarkLight1337 · 2025-05-14T05:05:36Z

I have actually moved a bunch of chat templates to vllm/transformers_utils/chat_templates recently, perhaps we can keep using that directory?

aarnphm · 2025-05-14T05:47:33Z

I plan to move some of the tools items in here as well, and chat_templates should be one of them imo

aarnphm · 2025-05-14T05:48:54Z

ig that the purpose of vllm/transformers_utils/chat_templates is to automatically apply the chat templates based on model type?

I would prefer explicitly set this rather than doing this automatically, because it would lead to unexpected behaviour imo

DarkLight1337 · 2025-05-14T06:04:30Z

ig that the purpose of vllm/transformers_utils/chat_templates is to automatically apply the chat templates based on model type?

I would prefer explicitly set this rather than doing this automatically, because it would lead to unexpected behaviour imo

Only registry.py is responsible for doing that. Not all of the chat templates in that directory have to be applied automatically.

aarnphm · 2025-05-14T17:37:54Z

so how would we use this feature?

vllm serve <model> --chat-templates deepseek_vl_2

russellb · 2025-05-14T19:03:42Z

Looking at your example in the PR description:

vllm serve <model> --chat-template tool_chat_template_hermes

What would you think about cleaning up the UX a bit by changing the naming schemes a bit where possible?

In this case, it would be really nice if the command was --chat-template hermes, for example.

It would also be nice to have a way to list the available templates, though that can be a different PR.

aarnphm · 2025-05-14T20:23:51Z

Looking at your example in the PR description:
vllm serve <model> --chat-template tool_chat_template_hermes
What would you think about cleaning up the UX a bit by changing the naming schemes a bit where possible?

Yeah I think we can cleanup the naming here. Initially I was thinking (very much inspired by nix):

--chat-template hermes # alpaca | deepseek_r1 | etc.
--chat-template tool/hermes # with tool_call for hermes, tool/deepseek
--chat-template hf:org/new_model#tool_call_template.jinja <-- this can live in the HF model repo
--chat-template github:vllm-project/vllm/main#path/to/template.jinja <-- TODO, maybe

by default all of the template that is known to us will be the "official" supported ones. Then model maker can probably also hosted their own to reduce the maintainability from our side

It would also be nice to have a way to list the available templates, though that can be a different PR.

This is included in the CLI description actually, though I think I can also add this to the EngineArgs docstring so it shows up in the docs

DarkLight1337 · 2025-05-15T03:26:50Z

so how would we use this feature?
vllm serve <model> --chat-templates deepseek_vl_2

registry.py currently only does #17805 . It is applied if user doesn't provide chat template but no chat template is available.

chaunceyjiang · 2025-05-15T10:56:34Z

Yeah I think we can cleanup the naming here. Initially I was thinking (very much inspired by nix):

--chat-template hermes # alpaca | deepseek_r1 | etc.
--chat-template tool/hermes # with tool_call for hermes, tool/deepseek
--chat-template hf:org/new_model#tool_call_template.jinja <-- this can live in the HF model repo
--chat-template github:vllm-project/vllm/main#path/to/template.jinja <-- TODO, maybe

Perhaps this is also a solution:
--chat-template https://xxxxx.template_vlm2vec.jinja
which allows using a file from the web.

chore: move templates within packages

55b5e70

Signed-off-by: Aaron Pham <[email protected]>

aarnphm requested a review from DarkLight1337 May 13, 2025 18:45

aarnphm requested review from russellb and mgoin May 13, 2025 18:45

aarnphm added the tool-calling label May 13, 2025

github-project-automation bot added this to Tool Calling May 13, 2025

mergify bot added documentation Improvements or additions to documentation frontend labels May 13, 2025

aarnphm added this to the v0.9.0 milestone May 13, 2025

fix: update tests

86e9887

Signed-off-by: Aaron Pham <[email protected]>

aarnphm requested review from robertgshaw2-redhat and simon-mo as code owners May 13, 2025 22:06

aarnphm added 2 commits May 13, 2025 23:06

feat: support optional choices

0c88e85

Signed-off-by: Aaron Pham <[email protected]>

fix: remove choices and substitute for docs instead

274b7e3

Signed-off-by: Aaron Pham <[email protected]>

aarnphm removed this from the v0.9.0 milestone May 15, 2025

DarkLight1337 mentioned this pull request Jun 2, 2025

[Misc] move templates to a dir #19005

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Frontend] Vendor exported templates to `vllm.tools` #18094

[Frontend] Vendor exported templates to `vllm.tools` #18094

aarnphm commented May 13, 2025

Uh oh!

github-actions bot commented May 13, 2025

Uh oh!

DarkLight1337 commented May 14, 2025

Uh oh!

aarnphm commented May 14, 2025

Uh oh!

aarnphm commented May 14, 2025

Uh oh!

DarkLight1337 commented May 14, 2025

Uh oh!

aarnphm commented May 14, 2025

Uh oh!

russellb commented May 14, 2025

Uh oh!

aarnphm commented May 14, 2025

Uh oh!

DarkLight1337 commented May 15, 2025 •

edited

Loading

Uh oh!

chaunceyjiang commented May 15, 2025

Uh oh!

Uh oh!

Uh oh!

[Frontend] Vendor exported templates to vllm.tools #18094

Are you sure you want to change the base?

[Frontend] Vendor exported templates to vllm.tools #18094

Conversation

aarnphm commented May 13, 2025

Uh oh!

github-actions bot commented May 13, 2025

Uh oh!

DarkLight1337 commented May 14, 2025

Uh oh!

aarnphm commented May 14, 2025

Uh oh!

aarnphm commented May 14, 2025

Uh oh!

DarkLight1337 commented May 14, 2025

Uh oh!

aarnphm commented May 14, 2025

Uh oh!

russellb commented May 14, 2025

Uh oh!

aarnphm commented May 14, 2025

Uh oh!

DarkLight1337 commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chaunceyjiang commented May 15, 2025

Uh oh!

Uh oh!

[Frontend] Vendor exported templates to `vllm.tools` #18094

[Frontend] Vendor exported templates to `vllm.tools` #18094

DarkLight1337 commented May 15, 2025 •

edited

Loading