Indonesia's Fake News Detection using Transformer Network

Aisyah Awalina; Jibran Fawaid; Rifky Yunus Krisnabayu; Novanto; Yudistira

arXiv:2107.06796·cs.CL·November 5, 2021

Indonesia's Fake News Detection using Transformer Network

Aisyah Awalina, Jibran Fawaid, Rifky Yunus Krisnabayu, Novanto, Yudistira

PDF

Open Access 1 Repo

TL;DR

This paper explores fake news detection in Indonesia using various machine learning models, finding that BERT with Transformer Network achieves the highest accuracy of 90% on a dataset of Indonesian news.

Contribution

It introduces a novel application of BERT with Transformer Network for fake news detection in Bahasa Indonesia, filling a gap in existing literature.

Findings

01

BERT with Transformer Network outperforms other models.

02

Achieved 90% accuracy in fake news detection.

03

Dataset from turnbackhoax.id used for training and testing.

Abstract

Fake news is a problem faced by society in this era. It is not rare for fake news to cause provocation and problem for the people. Indonesia, as a country with the 4th largest population, has a problem in dealing with fake news. More than 30% of rural and urban population are deceived by this fake news problem. As we have been studying, there is only few literatures on preventing the spread of fake news in Bahasa Indonesia. So, this research is conducted to prevent these problems. The dataset used in this research was obtained from a news portal that identifies fake news, turnbackhoax.id. Using Web Scrapping on this page, we got 1116 data consisting of valid news and fake news. The dataset can be accessed at https://github.com/JibranFawaid/turnbackhoax-dataset. This dataset will be combined with other available datasets. The methods used are CNN, BiLSTM, Hybrid CNN-BiLSTM, and BERT with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

JibranFawaid/turnbackhoax-dataset
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Advanced Malware Detection Techniques · Spam and Phishing Detection

MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Tanh Activation · Sigmoid Activation · Multi-Head Attention · Attention Dropout · Byte Pair Encoding · Weight Decay