🚢 Titanic Survival Prediction using Logistic Regression

This project uses a logistic regression model to predict passenger survival on the Titanic. The dataset is preprocessed and evaluated with standard machine learning practices using Scikit-Learn, Pandas, and Seaborn.

📁 Dataset

The dataset used is a cleaned Excel file: PreProccessing.Titanic.xlsx
Target column: survived
Removed columns: name, ticket (irrelevant for modeling)
Missing values in features are imputed (numerical: median, categorical: most frequent)

🧠 Model Overview

Model: Logistic Regression
Preprocessing:
- Numerical features (pclass, age, sibsp, parch, fare): median imputation + scaling
- Categorical features (sex, embarked): mode imputation + one-hot encoding
Evaluation:
- Accuracy
- Confusion Matrix
- ROC Curve & AUC
- Classification Report

🔧 How to Run

1. Install Dependencies

pip install pandas matplotlib seaborn scikit-learn openpyxl

2. Prepare the Dataset

Place the PreProccessing.Titanic.xlsx file in the project directory.

3. Run the Script

python titanic_logistic_regression.py

📊 Outputs

Confusion Matrix

A heatmap showing true positives, true negatives, false positives, and false negatives.

ROC Curve

Plots True Positive Rate vs False Positive Rate. Includes Area Under Curve (AUC) score.

Classification Report

Detailed metrics: precision, recall, F1-score for both classes.

⚙️ Preprocessing Pipeline

Built with ColumnTransformer and Pipeline
Numerical and categorical data handled separately
Improves modularity and reproducibility

📈 Example Output

Accuracy: 0.81

Classification Report:
              precision    recall  f1-score   support

           0       0.84      0.88      0.86       105
           1       0.76      0.70      0.73        74

    accuracy                           0.81       179
   macro avg       0.80      0.79      0.79       179
weighted avg       0.81      0.81      0.81       179

📌 Features

Clean and readable pipeline-based preprocessing
Visualizations: Confusion matrix and ROC curve
Performance metrics for evaluation
Easy to extend with other models (e.g., SVM, Random Forest)

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
logistic regression.rar		logistic regression.rar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🚢 Titanic Survival Prediction using Logistic Regression

📁 Dataset

🧠 Model Overview

🔧 How to Run

1. Install Dependencies

2. Prepare the Dataset

3. Run the Script

📊 Outputs

Confusion Matrix

ROC Curve

Classification Report

⚙️ Preprocessing Pipeline

📈 Example Output

📌 Features

About

Uh oh!

Releases

Packages

FaNa-AI/logisticRegression

Folders and files

Latest commit

History

Repository files navigation

🚢 Titanic Survival Prediction using Logistic Regression

📁 Dataset

🧠 Model Overview

🔧 How to Run

1. Install Dependencies

2. Prepare the Dataset

3. Run the Script

📊 Outputs

Confusion Matrix

ROC Curve

Classification Report

⚙️ Preprocessing Pipeline

📈 Example Output

📌 Features

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages