Skip to content

UWA CITS3401 Data Warehousing Project Semester 1 2019

Notifications You must be signed in to change notification settings

brucehow/cits3401-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CITS3401 Project · Python 3.7.2 MSSQL

Authors

Overview

This project is written for CITS3401 Data Warehousing, Semester 1 2019. It is a two part project, Data Warehousing (40 marks) and Data Mining (20 marks), with 2 bonus marks in Data Mining.

Kaggle's PetFinder Adoption Prediction data set is used for the project. ETL is performed using Python, and concept heirarchies are generated using draw.io.

Execute ETL and Association

$ cd submission/etl
$ python3 etl.py
$ cd submission/associations
$ python3 build_associations.py

Dependencies

  • Microsoft SQL Server
  • Microsoft Visual Studio SSDT
  • Microsoft SQL Server Management Studio
  • Python 3
  • Weka
  • PowerBI
  • draw.io

Discretion

These files should be used as a guideline to understanding the project. Copying the code in any way risks Academic Misconduct at the University of Western Australia. Discretion is advised.