Search for collections on Unika Repository

COMPARISON BETWEEN RANDOM FOREST AND XGBOOST PERFORMANCE IN TEXT CLASSIFICATION FOR EMOTION DETECTION

LUKITO, JESSICA ANGELA (2025) COMPARISON BETWEEN RANDOM FOREST AND XGBOOST PERFORMANCE IN TEXT CLASSIFICATION FOR EMOTION DETECTION. S1 thesis, UNIVERSITAS KATOLIK SOEGIJAPRANATA.

[img]
Preview
Text
21.K1.0029_JESSICA ANGELA LUKITO_COVER.pdf

Download (436kB) | Preview
[img] Text
21.K1.0029_JESSICA ANGELA LUKITO_BAB I.pdf
Restricted to Registered users only

Download (483kB)
[img] Text
21.K1.0029_JESSICA ANGELA LUKITO_BAB II.pdf
Restricted to Registered users only

Download (476kB)
[img] Text
21.K1.0029_JESSICA ANGELA LUKITO_BAB III.pdf
Restricted to Registered users only

Download (756kB)
[img] Text
21.K1.0029_JESSICA ANGELA LUKITO_BAB IV.pdf
Restricted to Registered users only

Download (705kB)
[img] Text
21.K1.0029_JESSICA ANGELA LUKITO_BAB V.pdf
Restricted to Registered users only

Download (406kB)
[img]
Preview
Text
21.K1.0029_JESSICA ANGELA LUKITO_DAPUS.pdf

Download (473kB) | Preview
[img] Text
21.K1.0029_JESSICA ANGELA LUKITO_LAMP.pdf
Restricted to Registered users only

Download (736kB)

Abstract

Humans can not read minds. In this era, where most people are using text-based communication through social media which are non-Face-to-Face interactions. A lot of misunderstandings happened during online conversations like texting because of unclear messages that leads to confusion. Unfortunately, the misunderstanding of a message can cause many negative things to happen such as fight, separation and many more. To resolve this issue, many research has been done by researchers. In some research, several researchers said that Random Forest is the best algorithm for text classification, while others said that XGBoost which is part of Decision Tree is the best. Moreover, there is a study that said Decision Tree is the worst performing algorithm for text classification. With this study, Random Forest and XGBoost as part of Decision Tree will be compared with several pre-processing scenarios and methods. Dataset used for this study is obtained from the Kaggle website which contains 416,809 unique values of sentences.

Item Type: Thesis (S1)
Subjects: 000 Computer Science, Information and General Works
Divisions: Faculty of Computer Science > Department of Informatics Engineering
Depositing User: ms. Wiwien Vieragustin
Date Deposited: 11 Jul 2025 01:08
Last Modified: 11 Jul 2025 01:08
URI: http://repository.unika.ac.id/id/eprint/37275
Keywords: UNSPECIFIED

Actions (login required)

View Item View Item