Skip to content

[BFCL] Add gemini-2.5-pro to the Leaderboard #974

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 4 commits into from
Apr 8, 2025

Conversation

catherineruoxiwu
Copy link
Contributor

Add the following new models to the leaderboard:

  • gemini-2.5-pro-exp-03-25-FC
  • gemini-2.5-pro-exp-03-25

Note: reasoning chain is now available on gemini's online playground (Freeform) but is not included in the API response at this moment.

@HuanzhiMao HuanzhiMao added the BFCL-New Model Add New Model to BFCL label Apr 7, 2025
Copy link
Collaborator

@HuanzhiMao HuanzhiMao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks for the PR @catherineruoxiwu !

@HuanzhiMao HuanzhiMao merged commit b6d3eef into ShishirPatil:main Apr 8, 2025
@catherineruoxiwu catherineruoxiwu deleted the add-gemini-2.5 branch April 9, 2025 02:23
HuanzhiMao added a commit that referenced this pull request Apr 14, 2025
This PR updates the leaderboard to reflect the change in score due to
the following PR merge:

1. #960 
2. #962 
3. #959 
4. #963 
5. #974 
6. #979 
7. #972 
8. #981 
9. #980 
10. #943 

Models were evaluated using checkpoint commit 9108a65.

Additionally, as mentioned in #943, the score for executable categories
(`rest`, `exec_simple`, `exec_multiple`, `exec_parallel`,
`exec_multiple_parallel`) will no longer be reported on the leaderboard.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BFCL-New Model Add New Model to BFCL
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants