COMPARISON OF RANDOM FOREST ALGORITHM ACCURACY WITH XGBOOST USING HYPERPARAMETERS

STEFANUS, KEVIN (2023) COMPARISON OF RANDOM FOREST ALGORITHM ACCURACY WITH XGBOOST USING HYPERPARAMETERS. Other thesis, UNIVERSITAS KHATOLIK SOEGIJAPRANATA.

[img] Text
19.K1.0009-KEVIN STEFANUS-COVER_a.pdf

Download (537kB)
[img] Text
19.K1.0009-KEVIN STEFANUS-BAB I_a.pdf
Restricted to Registered users only

Download (86kB)
[img] Text
19.K1.0009-KEVIN STEFANUS-BAB II_a.pdf
Restricted to Registered users only

Download (97kB)
[img] Text
19.K1.0009-KEVIN STEFANUS-BAB III_a.pdf
Restricted to Registered users only

Download (92kB)
[img] Text
19.K1.0009-KEVIN STEFANUS-BAB IV_a.pdf
Restricted to Registered users only

Download (297kB)
[img] Text
19.K1.0009-KEVIN STEFANUS-BAB V_a.pdf
Restricted to Registered users only

Download (238kB)
[img] Text
19.K1.0009-KEVIN STEFANUS-BAB VI_a.pdf
Restricted to Registered users only

Download (82kB)
[img] Text
19.K1.0009-KEVIN STEFANUS-DAPUS_a.pdf

Download (150kB)
[img] Text
19.K1.0009-KEVIN STEFANUS-LAMP_a.pdf
Restricted to Registered users only

Download (261kB)

Abstract

Diabetes is one of the most dangerous diseases in the world and many people do not realize that they have diabetes in them. So many factors affect the occurrence of diabetes such as pregnancies, glucose, blood pressure, skinthickness, insulin, BMI, diabetes pedigree function, and age. so diabetes threatens silently and will appear suddenly. Therefore, this study will make a diabetes prediction using Random Forest and XGBoost algorithms. The model will be evaluated with accuracy, F1-Score, recall ,and precision. for randomization or random state will use random state 0 and 45. The results obtained from the comparison of these two algorithms are the highest accuracy of the random forest algorithm has a value of 88,98% while the highest accuracy of XGBoost gets an accuracy value of 87,00% at random state 45 and data division 90/10, while at random state 0 random forest has the highest accuracy value also with a value of 78,43% with data division 90/10 while XGBoost gets the highest accuracy value of 76,47% at data division 90/10. It can be concluded that random forest is better at predicting diabetes data than the XGBoost algorithm.

Item Type: Thesis (Other)
Subjects: 000 Computer Science, Information and General Works > 004 Data processing & computer science
Divisions: Faculty of Computer Science > Department of Informatics Engineering
Depositing User: Mr Yosua Norman Rumondor
Date Deposited: 05 Oct 2023 06:21
Last Modified: 05 Oct 2023 06:21
URI: http://repository.unika.ac.id/id/eprint/32961

Actions (login required)

View Item View Item