jwszolek / cdc-replication-hadoop Star 7 Code Issues Pull requests Keep in sync RDB table with Hive structured store. Added Kafka as a buffer between those two tables. mysql sync kafka spark hive hadoop oracle spark-streaming cdc change-data-capture hql redo-actions mysql-connector orcfile Updated Feb 23, 2019 Python
Michu-dev / big-data-first-project Star 2 Code Issues Pull requests First academic big data project to implement analysis using MapReduce and Hive platform airflow hive data-engineering mapreduce-java orcfile Updated Jan 3, 2023 Java
Yo-mah-Ya / File_Creator Star 0 Code Issues Pull requests create files which formats are like "orc", "parquet", "xlsx", "json" and so on with Python pandas python3 parquet parquet-files orcfile Updated Oct 4, 2023 Python