foss42 · GANGSTER0910 · Mar 20, 2025 · Mar 26, 2025 · Mar 26, 2025 · Mar 26, 2025
diff --git a/doc/proposals/2025/gsoc/images/dashboard 2.png b/doc/proposals/2025/gsoc/images/dashboard 2.png
diff --git a/doc/proposals/2025/gsoc/images/dashboard1.png b/doc/proposals/2025/gsoc/images/dashboard1.png
diff --git a/doc/proposals/2025/gsoc/images/results.png b/doc/proposals/2025/gsoc/images/results.png
diff --git a/doc/proposals/2025/gsoc/poc_harsh_panchal_AI_API_EVAL.md b/doc/proposals/2025/gsoc/poc_harsh_panchal_AI_API_EVAL.md
@@ -0,0 +1,63 @@
+# AI API Evaluation Framework - Proof of Concept
+This is Proof of Concept (PoC) for AI API evaluation on a structured framework. It benchmarks AI language models against various performance metrics and provides actionable insights by means of easy-to-interpret visualizations. The PoC involves end-to-end integration from API calls to result analysis.
+
+## Objectives
+
+Benchmark the AI models (Falcon 7B, LLaMA 3.2) against others such as BLEU-4, ROUGE-L, BERTScore, METEOR.
+
+Use radar charts to provide a visual comparison.
+
+Facilitate effective monitoring of model performance by real-time latency and cost measurement.
+
+---
+##  Key Features Implemented
+Backend (FastAPI)
+Model Evaluation Endpoint: Tests AI models against given data sets.
+
+Scores such as BLEU-4, ROUGE-L, BERTScore, and METEOR are computed from Hugging Face models.
+
+Real-Time Performance Metrics: Tracks latency, cost, and processing time.
+
+Frontend (Flutter)
+Interactive Dashboard: Displays model scores in radar charts.
+
+Real-Time Data Display: Presents results of evaluation in a formatted way.
+
+Model Selection: Enables users to select from amongst available AI models.
+
+## Screenshots and Visuals
+![alt text](https://github.com/GANGSTER0910/apidash/blob/8fc7298824670397b07d0b42307bb1dd533af1fe/doc/proposals/2025/gsoc/images/dashboard1.png)
+
+![alt text](https://github.com/GANGSTER0910/apidash/blob/8fc7298824670397b07d0b42307bb1dd533af1fe/doc/proposals/2025/gsoc/images/dashboard%202.png)
+
+![alt text](https://github.com/GANGSTER0910/apidash/blob/8fc7298824670397b07d0b42307bb1dd533af1fe/doc/proposals/2025/gsoc/images/results.png)
+
+---
+##  Proof of Concept Details
+Models Evaluated: LLaMA 3.2 (3B) and Falcon 7B.
+
+Dataset: CNN Dailymail (3.0.0) used for benchmarking assessment.
+
+Evaluation Metrics
+
+BLEU-4: Scores n-gram overlap.
+
+ROUGE-L: Checks the longest consecutive matches.
+
+BERTScore: Scores contextual embeddings.
+
+METEOR: Considers synonyms, stemming, and grammar.
+
+To run the code 
+### Step 1: Clone the Repository
+```
+# Clone AI API Evaluation Repository
+
+git clone https://github.com/GANGSTER0910/AI_API_EVAL.git
+cd AI_API_EVAL
+
+# Install required Python packages
+pip install -r requirements.txt
+Provide your Hugging Face API token in the FastAPI.py file
+run python FastAPI.py
+```