Search for collections on Unika Repository

SENTIMENT ANALYSIS REVIEW COMMENT USING TRANSFORMER MODEL

KUSUMO, VALENTINO PUTRA BUDI KUSUMO (2024) SENTIMENT ANALYSIS REVIEW COMMENT USING TRANSFORMER MODEL. S1 thesis, UNIVERSITAS KATOLIK SOEGIJAPRANATA.

[img]
Preview
Text
21.K1.0005_VALENTINO PUTRA BUDI KUSUMO_COVER.pdf

Download (436kB) | Preview
[img] Text
21.K1.0005_VALENTINO PUTRA BUDI KUSUMO_BAB I.pdf
Restricted to Registered users only

Download (413kB)
[img] Text
21.K1.0005_VALENTINO PUTRA BUDI KUSUMO_BAB II.pdf
Restricted to Registered users only

Download (787kB)
[img] Text
21.K1.0005_VALENTINO PUTRA BUDI KUSUMO_BAB III.pdf
Restricted to Registered users only

Download (918kB)
[img] Text
21.K1.0005_VALENTINO PUTRA BUDI KUSUMO_BAB IV.pdf
Restricted to Registered users only

Download (1MB)
[img] Text
21.K1.0005_VALENTINO PUTRA BUDI KUSUMO_BAB V.pdf
Restricted to Registered users only

Download (533kB)
[img]
Preview
Text
21.K1.0005_VALENTINO PUTRA BUDI KUSUMO_DAPUS.pdf

Download (530kB) | Preview
[img] Text
21.K1.0005_VALENTINO PUTRA BUDI KUSUMO_LAMP.pdf
Restricted to Registered users only

Download (735kB)

Abstract

This research investigates the application of transformer-based models for sentiment analysis of movie reviews, emphasizing the role of preprocessing techniques. Traditional machine learning approaches have shown limited capability in capturing the emotional complexity of textual data. The study explores preprocessing methods such as case folding, symbol cleaning, tokenization, stopword removal, and stemming to evaluate their impact on the performance of the T5-small model integrated with a Multilayer perceptron (MLP). Using kaggle-sourced dataset, the research identifies as the optimal preprocessing combinations to maximize accuracy, precision, recall and F1-score. Experiments demonstrate that preprocessing enhances model;s generalization, reducing overfitting and improving performance metrics compared to unprocessed data. Grid search optimization identifies the best hyperparameters, achieving a peak validation accuracy of 70.56% with balanced precision and recall. The results. The results underline the necessity of tailored preprocessing for effective sentiment analysis while highlighting gaps for future research, such as dataset diversity and advanced hyperparameter tuning.

Item Type: Thesis (S1)
Subjects: 000 Computer Science, Information and General Works
000 Computer Science, Information and General Works > 004 Data processing & computer science
Divisions: Faculty of Computer Science > Department of Informatics Engineering
Depositing User: ms. Wiwien Vieragustin
Date Deposited: 10 Jul 2025 07:58
Last Modified: 10 Jul 2025 07:58
URI: http://repository.unika.ac.id/id/eprint/37153
Keywords: UNSPECIFIED

Actions (login required)

View Item View Item