Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
-
Updated
Jun 13, 2024 - Jupyter Notebook
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
A fully-functional Hadoop Yarn cluster as docker-compose deployment.
More than 2000+ Data engineer interview questions.
big data project, information storage in hdfs
This is a comprehensive solution for real-time football analytics, leveraging Apache Spark execution on yarn for both streaming and batch processing, Hadoop HDFS for distributed storage, Kafka for real-time data ingestion, rethinkdb for live data updates , a custom built search engine and Next.js for data visualization.
ETL Pipeline for Spar Nord Bank for the analysis of refilling frequency of the ATM's all over the europe
PyHDFS: Scalable & resilient distributed file system. Components: Zookeeper, NameNode, DataNode, Metadata service, Client. Setup guide for AWS & local. Explore distributed storage!
Hadoop Ecosystem - 대규모 빈발 패턴 마이닝을 위한 하둡 클러스터 환경 구축
Proceso ETL
Docker image builds for Hadoop sandbox.
Netflix Filtering and Recommendation Project
Average Temperature - Hadoop - Mapper - Reducer
Leverage the power of Apache Spark for large-scale data processing and analysis
旅游网站(携程网部分数据)大数据分析-hadoop课程设计(本科课设级别)
Implémentation d'une pipeline permettant de faire la prédiction de la maladie de parkinson via des outils d'IoT, Cloud, et Big Data
My first data analytics project I am creating along with the Data Analytics Essentials course by Cisco Networking Academy.
Add a description, image, and links to the hadoop-hdfs topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-hdfs topic, visit your repo's landing page and select "manage topics."