Simple application to demonstrate features of Spark core and Spark SQL components.
Provides analytics related Morning@Lohika events:
- unique participants by companies
- most loyal participants
- participants by position
- etc.
Features:
- simple HTTP-based API
- file system: local and HDFS
- data formats: CSV and Parquet
- 3 compatible implementations based on: RDD (Spark Core), Data Frame DSL (Spark SQL), Data Frame SQL (Spark SQL)
- serialization: default Java and Kryo
TBD
##How to use TBD
In case of any questions please contact me directly via [email protected] or [email protected]