Performance Evaluation of Machine Learning Classification Techniques for Diabetes Disease
Gahizi Emmanuel,Gilbert Gutabaga Hungilo,Andi W.R. Emanuel

University Atma Jaya Yogyakarta
Yogyakarta, Indonesia
University Atma Jaya Yogyakarta
Yogyakarta, Indonesia
University Atma Jaya Yogyakarta
Yogyakarta, Indonesia


Abstract

Diabetes is noncontagious disease where diabetes of type two mellitus is among top five leading the cause of global death. Not knowing the status of patient leads to complication such as kidney neuropathy and retinopathy, eventually lead to death. To know the patient’s stand using machine learning techniques can assist in early treatment will be effective in lowering the above mentioned burdens caused by diabetes. In this work researchers focused on evaluating the patient’s status of diabetes. In this study Cross Industry Standard Process for Data mining (CRISP-DM) used as research methodology of research. Where Support vector machine, Decision Tree, Naive Bayes used as classification technique .the study aim to predict the patient status for optimizing the complication caused by diabetes. The data set used for the model was retrieved from the Pima Indian diabetic database Diabetes Database (PIDD) which is obtained from UCI machine learning database with 768 records in total. KNN algorithm can be made best with an accuracy of 76% for the condensed dataset with the nine attributes as identified from the comparison of result of different models.

Keywords: diabetes; predicting; machine learning; classification algorithms, Cross Industry Standard Process for Data mining (CRISP-DM)

Topic: Information Engineering

AASEC 2020 Conference | Conference Management System