#

streaming-data

Here are 393 public repositories matching this topic...

oxnr / awesome-bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

data-science data awesome database data-stream bigdata series-database data-visualization data-warehouse stream-processing data-analytics awesome-list distributed-database visualize-data streaming-data

Updated May 7, 2024

kafka-ui

provectus / kafka-ui

Open-Source Web UI for Apache Kafka Management

opensource kafka big-data web-ui streams kafka-connect apache-kafka kafka-producer kafka-client kafka-streams hacktoberfest streaming-data kafka-manager kafka-cluster event-streaming cluster-management kafka-ui kafka-brokers

Updated May 16, 2024
Java

benthos

benthosdev / benthos

Fancy stream processing made operationally mundane

go golang kafka cqrs etl rabbitmq amqp logs message-bus event-sourcing nats stream-processing message-queue data-engineering streaming-data stream-processor data-ops

Updated May 20, 2024
Go

river

online-ml / river

🌊 Online machine learning in Python

python data-science machine-learning streaming stream-processing online-learning streaming-data concept-drift real-time-processing incremental-learning online-machine-learning online-statistics

Updated May 21, 2024
Python

materialize

MaterializeInc / materialize

The data warehouse for operational workloads.

rust distributed-systems streaming sql database kafka postgresql data-warehouse stream-processing streaming-data materialized-view postgresql-dialect operational-data-warehouse

Updated May 21, 2024
Rust

pravega / pravega

Pravega - Streaming as a new software defined storage primitive

streaming distributed-storage real-time-data streaming-data data-ingestion

Updated Apr 2, 2024
Java

piskvorky / smart_open

Utils for streaming large files (S3, HDFS, gzip, bz2...)

python streaming s3 file hdfs hacktoberfest webhdfs boto gzip-stream bz2 streaming-data

Updated May 8, 2024
Python

miller

johnkerl / miller

Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON

Updated May 20, 2024
Go

Stratio / sparta

Real Time Analytics and Data Pipelines based on Spark Streaming

workflow lambda streaming real-time scala kafka spark analytics spark-streaming olap hdfs sparksql triggers streaming-data stratio sparta stratio-sparta

Updated Oct 24, 2019
Scala

fluvio

infinyon / fluvio

Lean and mean distributed stream processing system written in rust and web assembly.

rust distributed-systems streaming real-time serverless webassembly data-flow stream-processing data-integration cloud-native data-pipelines stateful streaming-data stream-processing-engine event-driven-architecture streaming-data-processing streaming-data-pipelines

Updated May 20, 2024
Rust

scikit-multiflow / scikit-multiflow

A machine learning package for streaming data in Python. The other ancestor of River.

machine-learning stream scikit-learn streaming-data scikit moa meka

Updated Nov 2, 2023
Python

bbejeck / kafka-streams-in-action

Source code for the Kafka Streams in Action Book

streaming kafka stream-processing streaming-data kafkastreams

Updated Jul 11, 2021
Java

infoslack / awesome-kafka

A list about Apache Kafka

infrastructure kafka apache-spark stream-processing apache-kafka kafka-streams data-processing data-pipeline streaming-data

Updated Feb 9, 2024

go-streams

reugn / go-streams

A lightweight stream processing library for Go

Updated May 14, 2024
Go

python-streamz / streamz

Real-time stream processing for python

python real-time async streaming-data

Updated Dec 22, 2022
Python

microsoft / Trill

Trill is a single-node query processor for temporal or streaming data.

streaming-data temporal-data

Updated Jan 8, 2024
C#

Chulong-Li / Real-time-Sentiment-Tracking-on-Twitter-for-Brand-Improvement-and-Trend-Recognition

A real-time interactive web app based on data pipelines using streaming Twitter data, automated sentiment analysis, and MySQL&PostgreSQL database (Deployed on Heroku)

twitter dashboard tweets plotly stream-processing dash data-analysis topic-tracking twitter-sentiment-analysis streaming-data heroku-server brand-improvement

Updated May 11, 2020
Jupyter Notebook

kLabUM / rrcf

🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams

python machine-learning tree random-forest outliers streaming-data anomaly-detection detect-outliers robust-random-cut-forest

Updated Feb 24, 2024
Python

readysettech / readyset

Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the results of cached select statements and incrementally updates these results over time as the underlying data changes.

mysql rust caching postgres sql backend cache postgresql databases rust-lang mysql-database postgresql-database streaming-data caching-proxy

Updated May 20, 2024
Rust

guillermo-navas-palencia / optbinning

Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.

python stream optimization binning batch-processing credit-scoring scorecard streaming-data woe woebinning counterfactual-explanations mdlp

Updated Mar 25, 2024
Python

Improve this page

Add a description, image, and links to the streaming-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the streaming-data topic, visit your repo's landing page and select "manage topics."