Skip to content

A project for the lesson of Knowledge Discovery from Databases. Normalizing values, ENN and IB2 algorithms are implemented. Tested with iris.csv dataset.

Notifications You must be signed in to change notification settings

ThanSam/Data-Editing-and-Reduction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Editing and Reduction

A project for the univeristy lesson of "Knowledge Discovery from Databases".

Using as training datasets csv files, comprised of numerical attributes only with the exception of the class attribute, are implemented in Python 3.9 the following operations:

  • NormalizeValues(inputCsvFile):  Normalizing the values of all attributes-except for the class attribute(i.e., transforms them in the [0,1] range).

  • ENN(inputNormalizedCsvFile, K):  Takes as input a normalized csv file and the required algorithm parameter K. Applies the editing algorithm ENN on it.

  • IB2(inputNormalizedCsvFile):  Takes as input a normalized csv file and applies the instance reduction algorithm IB2 on it.

The code is tested with the "iris.csv" and "letter-recognition.csv" datasets.
The output in each program is written to a csv file as well.

About

A project for the lesson of Knowledge Discovery from Databases. Normalizing values, ENN and IB2 algorithms are implemented. Tested with iris.csv dataset.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages