AWS Cloudera Hadoop setup with H2O, Spark, MR
-
Updated
Apr 24, 2017 - Java
AWS Cloudera Hadoop setup with H2O, Spark, MR
COVID-19 data analysis with MapReduce
Big Data Technologies can be defined as software tools for analyzing, processing, and extracting data from an extremely complex and large data set with which traditional management tools can never deal
Applying MapReduce in Java on a Twitter dataset using Apache Hadoop
My portfolio | under development
🌟Spark Ceph Connector: Implementation of Hadoop Filesystem API for Ceph
An python implementation of Minimal Mapreduce Algorithms for Apache Spark
Learning Apache Hadoop for Big Data. Moreover, exploring Map Reduce, Apache Spark RDD, Distributed Processing and Stream Processing
Hadoop, HBase, Phoenix, and Zookeeper Integration
This repository contains all the material related to this big data certification.
Implementation of Statistical Methods via Hadoop Map-Reduce Library.
Big Data pipeline for real-time sensor fusion and predective analysis.
This repository aims to develop a basic search engine utilizing Hadoop's MapReduce framework to index and process extensive text corpora efficiently. The dataset used for this project is a subset of the English Wikipedia dump, totaling 5.2 GB in size. The project focuses on implementing a naive search algorithm to address challenges in information.
logback appender for apache-flume
Repository for the master's course Cloud Computing of the TU Berlin in the winter term 2020/21.
Installation and configuration of a small data lake
Add a description, image, and links to the apache-hadoop topic page so that developers can more easily learn about it.
To associate your repository with the apache-hadoop topic, visit your repo's landing page and select "manage topics."