Skip to content
View jiegzhan's full-sized avatar
  • Disney Streaming
  • San Francisco Bay Area, CA

Block or report jiegzhan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jiegzhan/README.md

Hi there 👋

  • 🔭 I am Zhang Jie (张 杰).

  • In 05/2024, I joined Disney Streaming as a Staff Software Engineer, where I mainly focus on data ingestion and data governance.

  • I was a Senior Software Engineer at Roku Big Data Platform team for 4.5 years, where I provided data infrastructure solutions both in large scale real time streaming processing and data lake batch processing.

  • Tech Stack: Flink, Spark, Kafka, Presto, Iceberg, Hive, Hadoop, Airflow, Kubernetes, Docker, AWS Stack, DataDog, Jupyter Notebook, Superset, Looker.

Real Time Streaming Processing ✅

Built a Flink & Kubenetes powered real time streaming platform which provides capabilities to build Flink streaming applications and run them on Kubernetes clusters seamlessly. Onboarded other engineering teams and promoted best streaming practices.

Data Lake Batch Processing ✅

Built and maintained a Spark & Hive & S3 & Airflow based data lake, architected and implemented distributed data ingestion and processing pipelines.

Pinned Loading

  1. multi-class-text-classification-cnn multi-class-text-classification-cnn Public

    Classify Kaggle Consumer Finance Complaints into 11 classes. Build the model with CNN (Convolutional Neural Network) and Word Embeddings on Tensorflow.

    Python 426 198

  2. multi-class-text-classification-cnn-rnn multi-class-text-classification-cnn-rnn Public

    Classify Kaggle San Francisco Crime Description into 39 classes. Build the model with CNN, RNN (GRU and LSTM) and Word Embeddings on Tensorflow.

    Python 599 262

  3. image-classification-rnn image-classification-rnn Public

    Classify MNIST image dataset into 10 classes. Build an image classifier with Recurrent Neural Network (RNN: LSTM) on Tensorflow.

    Python 87 49

  4. apache/hudi apache/hudi Public

    Upserts, Deletes And Incremental Processing on Big Data.

    Java 5.4k 2.4k

  5. prestodb/presto prestodb/presto Public

    The official home of the Presto distributed SQL query engine for big data

    Java 16.1k 5.4k

  6. flink flink Public

    Forked from apache/flink

    Apache Flink

    Java