Skip to content

Commit

Permalink
Uploading new version of X-ray CNN model project
Browse files Browse the repository at this point in the history
  • Loading branch information
roboswell committed Jun 20, 2024
1 parent f0fe7c4 commit 10c7770
Showing 1 changed file with 10 additions and 3 deletions.
13 changes: 10 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@
- Protection of Civilians (PoC) Team Intern, Division of Policy, Evaluation, and Training @ United Nations Department of Peace Operations (UN DPO) - New York, NY - 2019
- Refugee Status Determination (RSD) Intern @ United Nations High Commissioner for Refugees (UNHCR) - Cairo, Egypt - 2016

### Portfolio Projects
### Conflict Data Science Projects
- GIS & Spatial Analysis:
- **Title:** ["Examining Geospatial Covariate Relationships with Civilian Killings in South Sudan’s Civil War"](./GIS/GIS_covariate_relationships-killings-south_sudan.html)
- Focus: Geospatial Statistics
Expand All @@ -25,13 +25,20 @@
- Technology Used: R Markdown, ggmap, tmap, sp, sf, rgdal, stadia/stamen maps, plotly, dplyr, ggplot2, ggthemes, ggpubr, stringr, scales, and kableExtra
- Contents: Interactive and static charts, GIS maps, heat map tables, density maps, and union buffers and using statistics within them
- Natural Language Processing (NLP):
- **Title:** ["Using News Articles on Events in Nigeria in 2019 to Predict Political Violence"](./NLP/Using_Nigerian_News-based_ML_Models_to_Predict_Political_Violence.html)
- **Title:** ["Using News Articles to Predict Political Violence in Nigeria"](./NLP/Using_Nigerian_News-based_ML_Models_to_Predict_Political_Violence.html)
- Focus: NLP for Time-Series Forecasting
- Technology Used: Python, TF-IDF word transformation, NLTK library, Scikit-Learn machine learning models, Scikit-Learn's TimeSeriesSplit, Augmented Dickey-Fuller Test, first-differencing, lags
- Contents: Converting news articles by publishing date into time-series machine elarning forecasting models. Performance comparison between Ridge, Lasso, Random Forest, and XGBoost regression models
- **Title: "LDA Topic Modeling & VADER Sentiment Analysis for Political News Articles on Events Related to Nigeria in 2019"**
- **Title: "LDA Topic Modeling & VADER Sentiment Analysis for Political News Articles on Nigeria"**
- [Primary document (Python)](./NLP/Nigeria_News_LDA_&_Sentiment_Analysis.html)
- [Visualization for the project (R)](./NLP/Nigeria_News_Sentiment_Analysis-Viz-Created_in_R.html)
- Focus: Topic Modeling & Sentiment Analysis
- Technology Used: Python, R Markdown, Excel, NLTK for stopwords, PorterStemmer, and PunktSentenceTokenizer, gensim library for CoherenceModel, LdaModel, and corpora, Jaccard similarity, vaderSentiment library, itertools, ggplot2
- Contents: Text data cleaning, Latent Dirichlet Allocation (LDA) topic modeling of Nigerian news article text, VADER (Valence Aware Dictionary for Sentiment Reasoning) sentiment analysis scores for articles containing specific political words, compared across quarters of the year.

### Non-Conflict Data Science Projects
- Deep Learning for Medical Imaging:
- **Title:** ["Comparative Analysis of Deep Learning Models for X-ray Illness Classification"](./Neural%20Network%20Models/X-Ray%20Deep%20Learning%20Classificaton%20Models.html)
- Focus: Deep Learning for Image Classification
Technology Used: Python, Keras, CNNs, Transfer Learning, ImageDataGenerator, flow_from_directory, EarlyStopping, ModelCheckpoint, ReduceLROnPlateau
- Contents: Comparing the performance of 8 CNN deep learning models on X-ray images from three classes (COVID-19, viral pneumonia, and healthy). These include transfer learning models (e.g., InceptionV3), and various techniques to improve model generalization and help avoid overfitting (e.g., dropout, batch normalization, early stopping, data augmentation, L1 and L2 regularization, fire modules, and ways of using deep networks effectively). I also demonstrate best practices for structuring filters/kernels, channels, layers, activation functions, pooling, convolutional blocks, and other model components for optimal performance. Metrics include confusion matrixes, accuracy, precision, recall, F1-score, ROC curve, and AUC. Analysis of non-augmented vs. augmented data models with specific augmentation techniques are shown. Architectures and training strategies for each model are detailed.

0 comments on commit 10c7770

Please sign in to comment.