- anomaly detection system to detect outliers in server nodes using Gaussian model
- implemented on two sets of data
- dataset with two features: Latency and Throughput
- higher dimensional dataset with a set of 11 features
- jupyter with python 3.10.6 64-bit kernel
- python libraries: pandas, matplotlib, numpy, pylab
- 2-dimensional dataset
data_train.csv
: training set for the 2-D datadata_val.csv
: validation set includingX_val
andy_val
; separated after loading dataset
- 11-dimensional dataset
data2_X.csv
: training set with 11 featuresdata2_Xval.csv
: validation set with 11 featuresdata2_yval.csv
: validation labels to assess the model trained on the higher dimensional dataset
- performance of this unsupervised Gaussian model is determined using the F1-score for every iteration till it saturates to its optimum
- the model is trained over the training set to learn the parameters
mu
andsigmasq
denoting the dataset's mean and variance - validation set is fed to the model using the learnt
mu
andsigmasq
to learn the new parameterepsilon
by maximizing the F1 score - using the learnt parameter
epsilon
outliers are detected in the dataset
- 2-dimensional dataset
- optimum epsilon expected:
8.99e-05
- best F1 score:
0.875000
- 11-dimensional dataset
- optimum epsilon expected:
1.38e-18
- best F1 score:
0.615385
🗣 DIVERSION 2k23 encourages participation in the open source community, which grows bigger every year. Complete the challenge and win amazing rewards.
📢 Register here for DIVERSION 2k23 and start contributing to the projects.
[Add Feature list]
- Pull requests can be submitted to any opted-in repository on GitHub or GitLab.
- The pull request must contain commits you made yourself.
- If a maintainer reports your pull request as spam, it will not be counted toward your participation in Hacktoberfest.
- If a maintainer reports behavior that’s not in line with the project’s code of conduct, you will be ineligible to participate.
- To get a shirt, you must make four approved pull requests (PRs) on opted-in projects between October 1-31 in any time zone.
- This year, the first 40,000 participants can earn a T-shirt.
Do Read - Code of Conduct
Jump to - DIVERSION 2k23 guidelines (updated)
Why not start your open source journey by merging some codes here. It will be very great if you review the code and find some bugs and issues.
- Choose any of the Ways to contribute for contribution.
- Create a PR and commit by creating a new branch (:warning: Avoid commiting to the main)
- Be patience it might take time for the maintainers to review your PR.
- Congratulations on a successful merge.
- You can add your name to contributors list.
- Find a bug by running the project locally
- Solve the issues which are already set up
- Sugget some change or create your own isssue
- Create documentation like site structure, how to write a blog,etc
- Indent/Beautify/Modify codes.
Thank You! Keep Contributing