Cirrhosis Survival Prediction

This repository contains a machine learning project aimed at predicting the survival status of patients diagnosed with liver cirrhosis. Using a combination of data preprocessing techniques and machine learning models, this project provides a framework for accurate survival prediction based on clinical and laboratory features.

Introduction

Liver cirrhosis is a chronic condition that significantly impacts patient survival. This project leverages machine learning to classify survival status into three categories:

C: Compensated Cirrhosis
D: Decompensated Cirrhosis
CL: Chronic Liver Disease

The goal is to provide an automated and efficient way to assist clinicians in decision-making and resource allocation.

Dataset

Source

The dataset includes clinical and laboratory data for patients diagnosed with liver cirrhosis.

Structure

Training Data: 224 samples with 19 features (including 'Status').
Test Data: 88 samples with 18 features (excluding 'Status').

Preprocessing

Imputation of missing values using median values.
One-hot encoding for categorical features.
Balancing the target labels using SMOTE (Synthetic Minority Oversampling Technique).

Methodology

Steps

Data Cleaning and Preprocessing: Handled missing values and categorical encoding.
Class Balancing: Addressed label imbalance using SMOTE.
Model Training and Evaluation:
- Logistic Regression
- Random Forest
- Support Vector Machine (SVM)
Prediction: Used the best-performing model to generate test predictions.

Models Used

Logistic Regression

A simple baseline model offering interpretability.

Random Forest

Robust against overfitting and effective for non-linear relationships.

Support Vector Machine (SVM)

Versatile and effective in high-dimensional feature spaces.

Results

Model Performance

The models were evaluated on a validation set:

Random Forest achieved the best results with an accuracy of 82%, providing strong recall for the minority class ('CL').

Test Predictions

Predictions for the test dataset were saved to a file: test_predictions.csv.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
code.ipynb		code.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cirrhosis Survival Prediction

Introduction

Dataset

Source

Structure

Preprocessing

Methodology

Steps

Models Used

Logistic Regression

Random Forest

Support Vector Machine (SVM)

Results

Model Performance

Test Predictions

About

Releases

Packages

Languages

atacolak/ML_Cirrhosis-Survival-Prediction

Folders and files

Latest commit

History

Repository files navigation

Cirrhosis Survival Prediction

Introduction

Dataset

Source

Structure

Preprocessing

Methodology

Steps

Models Used

Logistic Regression

Random Forest

Support Vector Machine (SVM)

Results

Model Performance

Test Predictions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages