Skip to content

Commit c1cc4f0

Browse files
Create README.md
1 parent 16847da commit c1cc4f0

File tree

1 file changed

+32
-0
lines changed

1 file changed

+32
-0
lines changed
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,32 @@
1+
# Medical Insurance Cost Prediction 🚀
2+
3+
This project is part of **CodePeak 2025** and focuses on predicting medical insurance costs using machine learning.
4+
5+
## 📌 Project Overview
6+
The goal of this project is to analyze medical insurance data and build a predictive model to estimate insurance costs based on features such as age, BMI, smoking status, etc.
7+
8+
## 📂 Dataset
9+
- The dataset used for this analysis is **insurance.csv**.
10+
- It contains features such as age, sex, BMI, number of children, smoking status, and insurance charges.
11+
12+
## 🔧 Tech Stack
13+
- **Programming Language**: Python
14+
- **Libraries Used**:
15+
- **Data Processing**: Pandas, NumPy
16+
- **Visualization**: Matplotlib, Seaborn, Plotly
17+
- **Machine Learning**: Scikit-Learn, XGBoost, RandomForest
18+
- **Feature Engineering**: Ordinal Encoding, Label Encoding, Mutual Information
19+
- **Data Balancing**: SMOTE
20+
21+
## 📊 Exploratory Data Analysis (EDA)
22+
- Data cleaning and handling missing values
23+
- Feature correlations and distributions
24+
- Visualizations using Seaborn and Plotly
25+
26+
## 🏗️ Model Building
27+
- **Random Forest Regression**
28+
- **XGBoost Regression**
29+
- **Evaluation Metrics**:
30+
- Mean Squared Error (MSE)
31+
- R² Score
32+

0 commit comments

Comments
 (0)