Skip to content
View matteofortier's full-sized avatar
🧗
🧗

Block or report matteofortier

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
matteofortier/README.md

Hi there, I am Matteo 👋

I am a MSci Computer Science student studying at King's College London specialising in AI and Machine Learning!

Contact me via email at [email protected] or connect with me on LinkedIn! 👥💬

Check out my favourite personal projects!

Steam Games' Genre Multi-Label Classifier (Deep Learning)

  • Ingested 40,000 games to produce a dataset of 10,000 video game screenshots and 13 labels.
  • Developed custom tensor generators to support multi-label data and optimisations such as dataset cache-ing.
  • Implemented image augmentation techniques to reduce overfitting, improving model macro-f1 score by 0.3.
  • Developed a tailored macro-f1 metric and soft-macro-f1 loss function to score and train multi-label classification.
  • Modelled a CNN video-game genre labeller using TensorFlow and VGG-16 achieving a macro-f1 score of 0.54.

Recipe Recommender (NLP, Unsupervised Learning)

  • Ingested and pre-processed 1 million recipes to extract ingredients using NLTK, SpaCy, and Google Compute Platform.
  • Designed a topic modelling pipeline using TFIDF vectorizer and NMF to produce 20-100 intuitive topics.
  • Deployed a web-app using Streamlit that recommends the top 15 most similar recipes given an inputted/selected recipe.

Steam Game Success Classifier (Classification)

  • Produced a dataset of 40,000+ rows with 29 features from disparate sources to use for classification.
  • Automated tuning of 200 combinations of hyper-parameters using GridSearchCV.
  • Evaluated the performance of classification models such as XGBoost to achieve AUC of 0.922.

Twitch Stats Dashboard (Data Engineering)

  • Automated data collection from Twitch API to a MongoDB Atlas instance every 30 minutes.
  • Designed 8 interactive (twitch) data visualisations using Plotly and Dash.
  • Deployed a web-app on Heroku to allow for quick and easy access to live and historical data about Twitch and Twitch Partners.

University Course Satisfaction Predictor (Regression)

Exploring the Relation Between Yankees Home Games and MTA Subway Stations (EDA)

Popular repositories Loading

  1. ENG_PROJECT ENG_PROJECT Public

    Jupyter Notebook 3

  2. CLS_PROJECT CLS_PROJECT Public

    Jupyter Notebook 2

  3. NLP_PROJECT NLP_PROJECT Public

    Jupyter Notebook 2

  4. DL_PROJECT DL_PROJECT Public

    Jupyter Notebook 2

  5. MTA_EDA_Project MTA_EDA_Project Public

    Jupyter Notebook 1

  6. LR_PROJECT LR_PROJECT Public

    Jupyter Notebook 1