Indonesia's Fake News Detection using Transformer Network
Aisyah Awalina, Jibran Fawaid, Rifky Yunus Krisnabayu, Novanto, Yudistira

TL;DR
This paper explores fake news detection in Indonesia using various machine learning models, finding that BERT with Transformer Network achieves the highest accuracy of 90% on a dataset of Indonesian news.
Contribution
It introduces a novel application of BERT with Transformer Network for fake news detection in Bahasa Indonesia, filling a gap in existing literature.
Findings
BERT with Transformer Network outperforms other models.
Achieved 90% accuracy in fake news detection.
Dataset from turnbackhoax.id used for training and testing.
Abstract
Fake news is a problem faced by society in this era. It is not rare for fake news to cause provocation and problem for the people. Indonesia, as a country with the 4th largest population, has a problem in dealing with fake news. More than 30% of rural and urban population are deceived by this fake news problem. As we have been studying, there is only few literatures on preventing the spread of fake news in Bahasa Indonesia. So, this research is conducted to prevent these problems. The dataset used in this research was obtained from a news portal that identifies fake news, turnbackhoax.id. Using Web Scrapping on this page, we got 1116 data consisting of valid news and fake news. The dataset can be accessed at https://github.com/JibranFawaid/turnbackhoax-dataset. This dataset will be combined with other available datasets. The methods used are CNN, BiLSTM, Hybrid CNN-BiLSTM, and BERT with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Advanced Malware Detection Techniques · Spam and Phishing Detection
MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Tanh Activation · Sigmoid Activation · Multi-Head Attention · Attention Dropout · Byte Pair Encoding · Weight Decay
