Skip to content

[Related Issue: #618] Add GSoC 2025 Idea Proposal for AI API Eval #673

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Mar 21, 2025

Conversation

nb923
Copy link
Contributor

@nb923 nb923 commented Mar 17, 2025

Description:
This PR adds an initial idea draft for the AI API Eval Project. This project is to develop a Dart-centered evaluation framework designed to simplify the testing of generative AI models across multiple types (text, image, code). This will be done by integrating evaluation toolkits: llm-harness for text, torch-fidelity and CLIP for images, and HumanEval/MBPP with CodeBLEU for code.

Related Issue:
#618

Feedback:
Looking for feedback on the design, architecture, and features provided in the idea draft. Feedback will be greatly appreciated.

@nb923
Copy link
Contributor Author

nb923 commented Mar 17, 2025

@ashitaprasad looking for feedback on the idea draft, thanks!

@ashitaprasad
Copy link
Member

@nb923 Provide some designs for the feature in the issue and send across a draft PR implementing some features.

@ashitaprasad ashitaprasad merged commit a7237bf into foss42:main Mar 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants