Skip to content

vmieres/Natural-Language-Processing-NLP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 

Repository files navigation

Natural-Language-Processing-NLP-

This repo is about applying natural language processing to understand the sentiment in the latest news articles featuring Bitcoin and Ethereum. As well as applying fundamental NLP techniques to better understand the other factors involved with the coin prices such as common words and phrases and organizations and entities mentioned in the articles.

Sentiment Analysis

  • Which coin had the highest mean positive score? - Bitcoin

  • Which coin had the highest negative score? - Ethereum

  • Which coin had the highest positive score? - Ethereum

Natural Language Processing

  • Use NLTK to produce the ngrams for N = 2.
  • List the top 10 words for each coin.

Bitcoin

  1. 'bitcoin', 16
  2. 'reuters', 12
  3. 'cryptocurrency', 5
  4. 'year', 5
  5. 'november', 5
  6. 'currency', 5
  7. 'ruvic', 5
  8. 'virtual', 4
  9. 'day', 3
  10. 'reaching', 3

Ethereum

  1. 'reuters', 28
  2. 'bitcoin', 15
  3. 'currency', 12
  4. 'virtual', 11
  5. 'ruvic', 10
  6. 'london', 7
  7. 'november', 6
  8. 'new', 6
  9. 'york', 5
  10. 'ethereum', 4

Named Entity Recognition

Named entity recognition model for both coins. Visualizing the tags using SpaCy.