Search for collections on Unika Repository

THE EFFECT OF CHI-SQUARE FEATURE SELECTION ON THE NAIVE BAYES ALGORITHM IN ANALYZING THE SENTIMENT OF GOJEK APPLICATION REVIEWS ON GOOGLE PLAY STORE

DWINANTA, RAFAEL HANDIKA (2024) THE EFFECT OF CHI-SQUARE FEATURE SELECTION ON THE NAIVE BAYES ALGORITHM IN ANALYZING THE SENTIMENT OF GOJEK APPLICATION REVIEWS ON GOOGLE PLAY STORE. S1 thesis, UNIVERSITAS KATOLIK SOEGIJAPRANATA.

[img]
Preview
Text
21.K1.0028_RAFAEL HANDIKA DWINANTA_COVER.pdf

Download (548kB) | Preview
[img] Text
21.K1.0028_RAFAEL HANDIKA DWINANTA_BAB I.pdf
Restricted to Registered users only

Download (410kB)
[img] Text
21.K1.0028_RAFAEL HANDIKA DWINANTA_BAB II.pdf
Restricted to Registered users only

Download (425kB)
[img] Text
21.K1.0028_RAFAEL HANDIKA DWINANTA_BAB III.pdf
Restricted to Registered users only

Download (736kB)
[img] Text
21.K1.0028_RAFAEL HANDIKA DWINANTA_BAB IV.pdf
Restricted to Registered users only

Download (744kB)
[img] Text
21.K1.0028_RAFAEL HANDIKA DWINANTA_BAB V.pdf
Restricted to Registered users only

Download (404kB)
[img]
Preview
Text
21.K1.0028_RAFAEL HANDIKA DWINANTA_DAPUS.pdf

Download (615kB) | Preview
[img] Text
21.K1.0028_RAFAEL HANDIKA DWINANTA_LAMP.pdf
Restricted to Registered users only

Download (2MB)

Abstract

This study analyzes customer sentiment in reviewing the Gojek application to find out whether Chi-Square feature selection can improve the performance of the sentiment analysis model. This study uses 12,000 Gojek review data, starting with labeling positive, negative, or neutral based on user ratings of the reviews. Naive Bayes with and without Chi-Square feature selection is used in testing related to accuracy, precision, recall, and F1 score. The best performance is obtained by using alpha 0.5 combined with the best 2000 Chi-Square features, which produces 86.96% accuracy, 87.84% precision, 86.96% recall, and 85.29% F1 score on imbalanced data. SMOTE is also used to handle the low number of neutral reviews, but it produces lower accuracy. In conclusion, Chi-Square feature selection in the Naive Bayes algorithm can improve model accuracy on imbalanced and balanced datasets.

Item Type: Thesis (S1)
Subjects: 000 Computer Science, Information and General Works > 004 Data processing & computer science
Divisions: Faculty of Computer Science > Department of Informatics Engineering
Depositing User: ms. Wiwien Vieragustin
Date Deposited: 11 Jul 2025 01:08
Last Modified: 11 Jul 2025 01:08
URI: http://repository.unika.ac.id/id/eprint/37271
Keywords: UNSPECIFIED

Actions (login required)

View Item View Item