Skip to content

NLP on customer tweets to Apple Support to uncover topics using NMF (unsupervised modeling), and classify tweets as product types based on users' initial tweets using CorEx (semi-supervised modeling)

License

Notifications You must be signed in to change notification settings

syntheticjohn/apple_tweets_nlp

Repository files navigation

Text Classification On Customer Tweets To Apple Support

NLP on customer tweets sent to Apple Support to uncover topics over time using NMF (unsupervised topic modeling), and classify tweets as product types based on users' initiating tweets using CorEx with product-focused anchors (semi-supervised topic modeling)

Project overview:

  • Explored topics from ~100k customer tweets to Apple Support with NLP using NLTK for text cleaning, mongoDB for storage and NMF for topic modeling
  • Classified the product type based on a user's first tweet using CorEx with product-focused anchors in order to automatically route tweets to product-aligned teams within Apple customer support group

This repo includes:

  • apple_tweets_preprocessing.py: data preprocessing
  • apple_tweets_modeling.ipynb: topic modeling using NMF and CorEx
  • data: pickled files
  • apple_tweets_nlp_slides.pdf: pdf of project presentation slides

Note 1: The proprocessing python script is designed to run where mongodb has already been setup and pre-stored with the tweet data (sourced from: kaggle.com/thoughtvector/customer-support-on-twitter)
Note 2: Some data files were excluded from the data folder due to github's size limitation

About

NLP on customer tweets to Apple Support to uncover topics using NMF (unsupervised modeling), and classify tweets as product types based on users' initial tweets using CorEx (semi-supervised modeling)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published