OMPARISON BETWEEN CNN AND RANDOM FOREST PERFORMANCE IN DETECTING HOAX INDONESIAN NEWS ARTICLES

PRAMUDYA, FRANCISKA NUGRAHAENI SIWI (2024) OMPARISON BETWEEN CNN AND RANDOM FOREST PERFORMANCE IN DETECTING HOAX INDONESIAN NEWS ARTICLES. Skripsi thesis, UNIVERSITAS KATOLIK SOEGIJAPRANATA.

[img] Text
20.K1.0032-FRANCISKA NUGRAHAENI SIWI PRAMUDYA_COVER_1.pdf

Download (221kB)
[img] Text
20.K1.0032-FRANCISKA NUGRAHAENI SIWI PRAMUDYA_BAB I_1.pdf
Restricted to Registered users only

Download (86kB)
[img] Text
20.K1.0032-FRANCISKA NUGRAHAENI SIWI PRAMUDYA_BAB II_1.pdf
Restricted to Registered users only

Download (230kB)
[img] Text
20.K1.0032-FRANCISKA NUGRAHAENI SIWI PRAMUDYA_BAB III_1.pdf
Restricted to Registered users only

Download (121kB)
[img] Text
20.K1.0032-FRANCISKA NUGRAHAENI SIWI PRAMUDYA_BAB IV_1.pdf
Restricted to Registered users only

Download (964kB)
[img] Text
20.K1.0032-FRANCISKA NUGRAHAENI SIWI PRAMUDYA_BAB V_1.pdf
Restricted to Registered users only

Download (79kB)
[img] Text
20.K1.0032-FRANCISKA NUGRAHAENI SIWI PRAMUDYA_DAPUS_1.pdf

Download (143kB)
[img] Text
20.K1.0032-FRANCISKA NUGRAHAENI SIWI PRAMUDYA_LAMPIRAN_1.pdf
Restricted to Registered users only

Download (328kB)

Abstract

Hoax news is a serious problem in this era. Many people are easily led by opinions made by certain people without seeing the truth or looking for existing facts. To overcome this, many researchers have conducted hoax news detection using various algorithms. In some studies, it is said that Random Forest has better performance to overcome this hoax news problem. In other studies, it is also said that CNN has the same level of performance as the Random Forest algorithm. In addition, the problem that is often found is the error in prediction due to improper preprocessing methods. Therefore, in this research, the appropriate preprocessing method is searched by using several preprocessing scenarios for the Convolutional Neural Network (CNN) and Random Forest algorithms. Therefore, in addition to finding the right preprocessing method for each algorithm, a performance comparison is also carried out on the CNN and Random Forest algorithms using a dataset of 4000 news facts from Kompas.com and 4000 hoax news from the turnback.hoax site. the results obtained in this study are random forest has an average model accuracy value of 90% and the cnn algorithm has an average model accuracy value of 60% using the same extraction method, namely TFIDF combined with Ngrams worth one or unigram.

Item Type: Thesis (Skripsi)
Subjects: 000 Computer Science, Information and General Works > 004 Data processing & computer science
Divisions: Faculty of Computer Science > Department of Informatics Engineering
Depositing User: Mr Yosua Norman Rumondor
Date Deposited: 05 May 2024 12:11
Last Modified: 05 May 2024 12:11
URI: http://repository.unika.ac.id/id/eprint/35286

Actions (login required)

View Item View Item