According to WHO about 422 million people worldwide have diabetes. Since diabetes affects a large population across the globe and the collection of these datasets is a continuous process and it comprises of various patient related attributes such as age, gender, symptoms, insulin levels, blood pressure, blood glucose levels, weight etc. We are working on Pima Indians Diabetes Dataset (PIDD), extracted from the University of California, Irvine (UCI) machine learning repository.
PIDD consists of several medical parameters and one dependent (outcome) parameter of binary values .This dataset is mainly for female gender and Description of dataset is as following 9 columns with 8 independent parameter and one outcome parameter with uniquely identified 768 observations having 268 positive for diabetes (1) and 500 negative for diabetes (0)
- Pregnancies : Number of times pregnant
- Glucose: Oral Glucose Tolerance Test result
- BloodPressure: Diastolic Blood Pressure values in (mm Hg)
- SkinThickness: Triceps skin fold thickness in (mm)
- Insulin: 2-Hour serum insulin (mu U/ml)
- BMI: Body mass index
- DiabetesPedigreeFunction: Diabetes pedigree function
- Age: Age in years
- Outcome: Class 1 indicates person having diabetes and 0 indicates other.
https://www.kaggle.com/uciml/pima-indians-diabetes-database
Predict whether or not a person from the Pima tribe is likely to have Diabetes
- Kneighborsclassifier
- Logisticregression
- Decisiontreeclassifier
- Svc-svm xgbclassifier
- Randomforestclassifier