Substation is a toolkit for routing, normalizing, and enriching security event and audit logs.
-
Updated
Jun 4, 2024 - Go
Substation is a toolkit for routing, normalizing, and enriching security event and audit logs.
Advanced and Fast Data Transformation in R
This repository offers Python scripts for efficiently processing data from cognitive tests like PVT, DSST, and Serial Addition, utilizing Streamlit for streamlined batch processing. It converts raw outputs into structured CSVs for comprehensive and individual analyses.
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Classify an email as spam or not spam using the logistic regression machine learning model. Additionally, analyze concrete compressive strength data using linear regression.
A terminal-based application for computing statistics based on Calgary's most popular licensed dog breeds dataset. Utilize Pandas DataFrame objects for data manipulation and analysis, including importing Excel data, hierarchical indexing, and processing data according to specifications.
The MDSplus data management system
SQL-like interface to tabular data structures
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
Python Stream Processing
Kubernetes-native platform to run massively parallel data/streaming jobs
Remote Sensing and GIS Software Library; python module tools for processing spatial data.
CBRAIN is a flexible Ruby on Rails framework for accessing and processing of large data on high-performance computing infrastructures.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
This project automates the scraping of news articles from the United Daily News (UDN) website, filters and processes them using specified keywords and OpenAI's GPT for Named Entity Recognition (NER), and exports the categorized data into a CSV file.
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
♿ Suite of open and standards-based tools for performing reliable accessibility conformance testing at scale
ESA Earth Observation Toolbox and Java Development Platform
Data and tools for generating and inspecting OLMo pre-training data.
Add a description, image, and links to the data-processing topic page so that developers can more easily learn about it.
To associate your repository with the data-processing topic, visit your repo's landing page and select "manage topics."