This project focuses on exploring and predicting mango fruit defect detection system (DDS) outcomes using machine learning models. A comprehensive analysis was performed on a dataset sourced from Kaggle, with model comparison based on multiple evaluation metrics.
- Source: MangoFruitDDS Dataset on Kaggle
- The dataset contains features related to mango fruit attributes and defect classifications.
The following classification models were applied and compared:
- Logistic Regression
- Ridge Regression
- Lasso Regression
- Naive Bayes
- K-Nearest Neighbors (KNN)
- Support Vector Machine (SVM)
Models were evaluated using:
- AUC-ROC Curve
- Precision-Recall Curve
- Confusion Matrix
- Box Plots
- K-Nearest Neighbors (KNN) achieved the highest predictive performance across most evaluation metrics.
- Logistic Regression followed closely, offering strong performance with interpretability.
- Comparative model analysis for classification on mango defect detection.
- Visual performance insights using AUC-ROC, PR curve, confusion matrix, and box plots.
- Hyperparameter tuning and regularization explored with Ridge and Lasso regression.
- Python (Pandas, NumPy, Scikit-learn, Matplotlib, Seaborn)
- Jupyter Notebook
- Clone this repository.
- Install dependencies using
pip install -r requirements.txt
. - Open the Jupyter notebook and run each cell step-by-step.
β If you found this helpful, feel free to star the repository!