Deep Autoencoders based Anomaly Detection for Transactional data

Project Overview

Objective

This project focuses on the development of deep learning models based on autoencoders for the purpose of anomaly detection. Autoencoders are neural networks used to learn compressed representations of raw data, making them effective tools for detecting anomalies in datasets. The project also involves deploying the trained model as an API using Flask.

Aim

The primary objectives of this project include:

For normal transactions developing a deep learning model based on autoencoders for anomaly detection.
Deploying the model as an API using Flask for real-time anomaly detection.

Data Overview

The dataset used in this project is a transaction dataset containing information on more than 100,000 transactions, each characterized by several features. This data serves as the foundation for training and testing the deep autoencoder model.

Tech Stack

Language: Python
Packages: Pandas, Numpy, Matplotlib, Keras, Tensorflow
API Service: Flask, Gunicorn

Approach

The project follows a structured approach:

Understand the business objective and the importance of anomaly detection.
Perform exploratory data analysis (EDA) to gain insights into the dataset.
Normalize and clean the data, addressing any missing values through imputation.
Delve into the theory behind autoencoders and their architecture.
Build a base autoencoder model using the Keras library.
Fine-tune the model to extract the best performance for anomaly detection.
Make predictions using the trained model to identify anomalies.
Serve the model as an API endpoint using Flask, enabling real-time anomaly detection.

Modular Code

input: Contains the dataset files used for analysis (e.g., final_cred_data.csv, Test-data.csv).
src: The heart of the project, this folder contains modularized code for various steps, including data preprocessing, model building, and deployment. It consists of the ML_pipeline and engine.py files, each containing functions for different functionalities.
output: Contains pre-trained models saved as .pkl files. These models can be conveniently loaded and used without the need for retraining.
lib: A reference folder with the original IPython notebook.
requirements.txt: Lists all required libraries and their versions for easy installation using pip.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

input

input

lib

lib

output

output

src

src

LICENSE

LICENSE

readme.md

readme.md

requirements.txt

requirements.txt

Repository files navigation

Deep Autoencoders based Anomaly Detection for Transactional data

Project Overview

Objective

Aim

Data Overview

Tech Stack

Approach

Modular Code

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
input		input
lib		lib
output		output
src		src
LICENSE		LICENSE
readme.md		readme.md
requirements.txt		requirements.txt

License

AjNavneet/Transactions_AnomalyDetection_DeepAutoencoder_Flask

Folders and files

Latest commit

History

Repository files navigation

Deep Autoencoders based Anomaly Detection for Transactional data

Project Overview

Objective

Aim

Data Overview

Tech Stack

Approach

Modular Code

About

Topics

Resources

License

Stars

Watchers

Forks

Languages