Here are
444 public repositories
matching this topic...
Scala code to read Parquet files as streams in Spark Streaming using Avro.
Updated
May 5, 2016
Scala
Experiment with Apache Parquet and Apache Avro
How to use Parquet in Flink
Parquet demo project for the Workshop in the Course DIS. Benchmarks Parquet versus ORC, JSON and CSV
A light Kafka to HDFS/S3 ETL library based on Apache Spark
Updated
Jun 29, 2017
Scala
A collection of Spark/Scala example programs
Dump RDBMS table data into a parquet
Updated
Aug 28, 2017
Scala
Annual Revenue Vs. Executive Pay for Recipients of U.S. Federal Funds; uses Scala Spark in Zeppelin notebook.
apache-parquet-kotlin-python-survey
Updated
Sep 28, 2017
Python
Yet Another Avro CLI Tool
Updated
Nov 20, 2017
Java
Test Apache Spark with parquet/root/binary IO vs c++ processing
Updated
Nov 22, 2017
Scala
Ingest a CSV file and store it in Parquet format with SBT
Updated
Jan 10, 2018
Scala
Updated
Jan 18, 2018
Scala
This repo is a scala project that helps convert to and from hadoop file formats without having to use a hadoop cluster i.e. in local mode
file format specific benchmarks for Parquet, ORC, Avro, JSON, and Arrow
Updated
Feb 6, 2018
Scala
Improve this page
Add a description, image, and links to the
parquet
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
parquet
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.