Skip to content

9kittenCo/spark-streaming-final-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Setup docker

  1. Setup docker for kafka cluster & Redis
docker-compose up kafka-cluster
  1. Run docker for kafka cluster
docker run --rm -it -v "$(pwd)":/data --net=host landoop/fast-data-dev bash

Setup Kafka Connect

  1. Build connector

Setup Kafka

  1. Create topic in kafka with 3 partitions
kafka-topics --create --topic events-standalone --partitions 3 --replication-factor 1 --zookeeper 127.0.0.1:2181
  1. Generate source data
python3 botgen_line.py -b 1 -u 1000 -n 100 -d 300 -f data/data_log.json
  1. Load source data to Kafka topic with Kafka Connect
connect-standalone worker.properties file-stream.properties