Jupyter notebook server prepared for running Spark with Scala kernels on a remote Spark master
-
Updated
Apr 25, 2020 - Jupyter Notebook
Jupyter notebook server prepared for running Spark with Scala kernels on a remote Spark master
Your go-to-cheatsheet to learn apache-Hadoop.
Hadoop in docker cluster, created by docker-compose. Create Hadoop cluster in less than 5mins.
Modern Big Data Analysis: recommend which pair of United States airports should be connected with a high-speed passenger rail tunnel.
Hdfs Block Storage System
Apache Spark with HDFS cluster within Kubernetes
Simplified Hadoop Setup and Configuration Automation
News Sentiment Analysis using ETL pipeline
In this project we have used comments from reddit to play around with multiple functionalities of Apache Spark, HDFS and Docker.
An example of installation Apache Spark on AWS
Add a description, image, and links to the hdfs-cluster topic page so that developers can more easily learn about it.
To associate your repository with the hdfs-cluster topic, visit your repo's landing page and select "manage topics."