A curated list of awesome big data frameworks, ressources and other awesomeness.
-
Updated
May 7, 2024
A curated list of awesome big data frameworks, ressources and other awesomeness.
Greenplum Database - Massively Parallel PostgreSQL for Analytics. An open-source massively parallel data platform for analytics, machine learning and AI.
2019新型冠状病毒疫情时间序列数据仓库 | COVID-19/2019-nCoV Infection Time Series Data Warehouse
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
The data warehouse for operational workloads.
Open source SQL Query Assistant service for Databases/Warehouses
A powerful open source data warehouse system
[NOT MAINTAINED] Light-weight Python OLAP framework for multi-dimensional data analysis
Privacy and Security focused Segment-alternative, in Golang and React
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Personal Data Engineering Projects
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
TensorBase is a new big data warehousing with modern efforts.
The ix modeling platform for integrated and cross-cutting scenario analysis
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
DomainMOD is an open source application written in PHP & MySQL used to manage your domains and other internet assets in a central location. DomainMOD also includes a Data Warehouse framework that allows you to import your web server data so that you can view, export, and report on your live data.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
🐳 Tool to automate data quality checks on data pipelines
Use sample content to explorer SAP Datasphere. The downloads contain sample data as CSV files, but could also include model / metadata information. See the README files for details.
Add a description, image, and links to the data-warehouse topic page so that developers can more easily learn about it.
To associate your repository with the data-warehouse topic, visit your repo's landing page and select "manage topics."