Udacity_AI_healthcare_P1_PneumoniaDetection

Pneumonia Detection from Chest X-rays and FDA submission

Problem description

The task is to analyze data from the NIH Chest X-ray Dataset and train a CNN to classify a given chest x-ray for the presence or absence of pneumonia. The project will culminate in a model that can predict the presence of pneumonia with human radiologist-level accuracy that can be prepared for submission to the FDA for 510(k) clearance as software as a medical device. As part of the submission preparation, we need to formally describe the model, the data that it was trained on, and a validation plan that meets FDA criteria.

The Dataset

NIH Chest Xray Dataset - 112,000 chest x-rays with disease labels acquired from 30,000 patients. The disease labels were created using Natural Language Processing (NLP) to mine the associated radiological reports. The labels include 14 common thoracic pathologies:

Atelectasis
Consolidation
Infiltration
Pneumothorax
Edema
Emphysema
Fibrosis
Effusion
Pneumonia
Pleural thickening
Cardiomegaly
Nodule
Mass
Hernia The biggest limitation of this dataset is that image labels were NLP-extracted so there could be some erroneous labels but the NLP labeling accuracy is estimated to be >90%.

The original radiology reports are not publicly available but you can find more details on the labeling process here.

Dataset Contents: 112,120 frontal-view chest X-ray PNG images in 1024*1024 resolution (under images folder) Meta data for all images (Data_Entry_2017.csv): Image Index, Finding Labels, Follow-up #, Patient ID, Patient Age, Patient Gender, View Position, Original Image Size and Original Image Pixel Spacing.

Getting Started - installations

Python packages

python 3.6
numpy
tensorflow
keras
pandas
matplotlib
sklearn

Execution

This project would require GPUs to train the deep learning model.

Note: I have not uploaded the hdf5 file, the weights of the trained model since its a big file.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Build and train model.ipynb		Build and train model.ipynb
EDA.ipynb		EDA.ipynb
FDA_Submission.pdf		FDA_Submission.pdf
Inference.ipynb		Inference.ipynb
README.md		README.md
my_model.json		my_model.json
sample_labels.csv		sample_labels.csv
test1.dcm		test1.dcm
test2.dcm		test2.dcm
test3.dcm		test3.dcm
test4.dcm		test4.dcm
test5.dcm		test5.dcm
test6.dcm		test6.dcm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Udacity_AI_healthcare_P1_PneumoniaDetection

Problem description

The Dataset

Getting Started - installations

Execution

About

Releases

Packages

Languages

sulagnag/Udacity_AI_healthcare_P1_PneumoniaDetection

Folders and files

Latest commit

History

Repository files navigation

Udacity_AI_healthcare_P1_PneumoniaDetection

Problem description

The Dataset

Getting Started - installations

Execution

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages