-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add ThinkAgents/ThinkAgent-1B #928
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
When i used the default one, the generations were messy, so I changed it to be the same formatting function i used for finetuning the model. |
In that case, if you’re using a custom chat template that provides better generation results, please document it in the model card. This way, users will know exactly how to replicate your function-calling setup, and we’ll benchmark the model using your recommended approach so the score accurately reflects the typical user experience. |
I have updated the model tokenizer to use the correct chat template used in the _format_prompt function. please review it and let me know if further updates are needed. |
I think the chat template and
but in your `_format_prompt), you only have this:
Notice how the |
@0xayman Would you be fine if I directly made modifications to your branch? I can also raise a PR to your branch instead. Either way is fine with me. |
Yes it's fine, you can go ahead with and make and required modifications. |
Are you doing the |
Yes I'm passing the tools in the user message instead of the system prompt. I found this to work better |
I got the following for data_overall.csv. Does this align with what you obtained?
|
Can you please share the csv file so I can check it ? |
It is close to what I get for |
My latest evaluation records: |
I've updated the handler and attached the latest evaluation results. |
I generated another run, and attached the fully formatted prompt before it hit the completion endpoint for test case id If everything looks good to you, I’ll go ahead and merge the PR and update the leaderboard with your model’s score!
|
Yes the default system prompt should not be included. Here is the formatting function I was usnig initially:
Can you please tell me if there are any particular reasons we can't use it? |
As explained here, the issue with your formatting function is that, it is not aligned with what the chat template from the model card on huggingface is suggesting. |
I think I misunderstood the last message. but,, where is this part of the prompt is coming from:
If I'm not mistaken, the formatting function in the current version of the code uses the following logic:
It cutsoff the default system prompt then append my custom system prompt. |
The default system prompt is not cut off. It's still included through these two lines.
|
Please review the latest commit I've made. |
@HuanzhiMao Just a reminder to check if everthing is going good. |
Regarding your last commit, 7f1f62f, it makes sense to not include the default system prompt here. However, these changes don't make sense. They have nothing to do with system prompt, and you are not following your own chat template. For example, for this part of the chat template on function doc format, it does not translate to just
|
I've fixed the chat template and made a new commit. |
@HuanzhiMao Any updates so far? |
I believe you haven't addressed my above concern in your new commit. |
The part you've mentioned is no longer part of the chat_template, it has been replaced with this code In the last commit. Also the model's chat_template has been updated in huggingface. |
What about these new-added |
Are you sure you are viewing the latest commit ? https://github.com/0xayman/gorilla/tree/382b4957f60a3245c37a5446a2a96cb758e645f6 |
Yes. For example, the |
I'm not sure if it will affect the results, but I've removed them to be consistent. |
Add ThinkAgents/ThinkAgent-1B model and model handler