Wavelet Feature Extraction for Thoracic Diseases Detection

Project Description

This project delves into the potential of wavelet transforms in digital image processing, targeting Chest X-Ray (CXR) scans for thoracic disease detection. Wavelets, mathematical functions that divide signals into time and frequency components, stand as powerful tools in refining image quality and revealing intricate details. Their strength lies in efficiently handling non-stationary signals, making them indispensable for medical imaging, especially in CXR classification. The project originated from the motivation to develop faster diagnostic methods for identifying COVID-19 and evolved to demonstrate a significant reduction in computational costs while maintaining high accuracy levels.

Features

Wavelet-based feature extraction from CXR images.
Dataset generation for different wavelet configurations.
Training and evaluation of machine learning models, including RandomForest, XGBoost, and Logistic Regression.
Ablation studies by zeroing out wavelet features.

Architecture

Getting Started

Prerequisites

Ensure you have conda installed.

Setting up the Environment

Clone the repository:

git clone https://github.com/AmiteshBadkul/WaveletCXR
cd WaveletCXR

Create a conda environment using the provided environment.yml file:
```
cd environment
conda env create -f environment.yml
```
Activate the environment:
```
conda activate waveletCXR
```

Data

Here is the link to the dataset --> Thoracic Disease Classification

Usage

To generate a dataset with wavelet transformed features from CXR images:

# Usage: dataset.py [OPTIONS]

# Directory containing the CXR images
--input_dir="/path/to/images"

# Directory where the generated datasets will be saved
--output_dir="/path/to/output"

# Type of wavelet used for dataset generation (default: "bior2.4")
--wavelet_type="bior2.4"

# Decomposition level used for dataset generation (default: 1)
--level=1

# Example command:
python dataset.py --input_dir "/path/to/images" --output_dir "/path/to/output" --wavelet_type "bior2.4" --level 1

For automated dataset generation with different wavelet configurations:
```
python dataset_generation.py
```

To train and evaluate a model based on the provided dataset and algorithm:

# Usage: main.py [OPTIONS]

# Type of wavelet used for dataset generation (default: "bior2.4")
--wavelet_type="bior2.4"

# Decomposition level used for dataset generation (default: 1)
--level=1

# Directory containing the datasets
--input_dir="/path/to/dataset"

# Algorithm to use for training (Choices: 'RF', 'XGBoost', 'Logistic')
--algorithm="RF"

# Number of trees in the forest (for RF) or Number of boosting rounds (for XGBoost). Default is 100.
--n_estimators=100

# Maximum depth of the tree. Default is None.
--max_depth

# Learning rate for the algorithm (for XGBoost). Default is 0.1.
--learning_rate=0.1

# Maximum number of iterations for convergence (used in Logistic Regression). Default is 100.
--max_iter=100

# Directory to save the config, trained model, and results
--output_dir="/path/to/results"

# Flag to conduct an ablation study by zeroing out wavelet features (default: "False").
--ablation="False"

# Example command:
python main.py --input_dir "/path/to/dataset" --output_dir "/path/to/results" --algorithm "RF"

For automated training with different wavelet configurations:
```
python all_run.py
```
For automated analysis of the results:
```
python analysis.py
```

Additional Notes:

The trainer.py script provides functionalities for training and evaluating models based on the wavelet-processed dataset.
The utils.py script contains functions for measuring the computational load.
The code/cnn/ folder contains the files associated with running and storing the various CNN models with functionality to save the performance and computational load.

Acknowledgments

This project was carried out under the supervision of Dr. Sudha Radhika at BITS Pilani, Hyderabad.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
code		code
environment		environment
.gitignore		.gitignore
README.md		README.md
architecture.png		architecture.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wavelet Feature Extraction for Thoracic Diseases Detection

Project Description

Features

Architecture

Getting Started

Prerequisites

Setting up the Environment

Data

Usage

Additional Notes:

Acknowledgments

About

Releases

Packages

Languages

AmiteshBadkul/WaveletCXR

Folders and files

Latest commit

History

Repository files navigation

Wavelet Feature Extraction for Thoracic Diseases Detection

Project Description

Features

Architecture

Getting Started

Prerequisites

Setting up the Environment

Data

Usage

Additional Notes:

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages