This project analyzes the Titanic dataset and predicts passenger survival using multiple Machine Learning algorithms.
- Data Cleaning (handled missing values)
- Feature Engineering (FamilySize, encoding categorical variables)
- Exploratory Data Analysis
- Model Training and Evaluation
- Logistic Regression
- K-Nearest Neighbors (KNN)
- Decision Tree
| Algorithm | Accuracy |
|---|---|
| Logistic Regression | 81.0% |
| KNN | 81.5% |
| Decision Tree | 78.2% |
- Accuracy comparison bar chart
- Line graph of model performance
- KNN performed best on this dataset
- Feature engineering improved model accuracy
- Decision Tree showed signs of overfitting
titanic_survival_prediction.ipynb→ Main notebooktrain.csv,test.csv→ Datasetimages/→ Graphs and outputs
- Hyperparameter tuning
- Random Forest / XGBoost models
- Deployment using Streamlit
Built as part of a Machine Learning project