NLPBK at VLSP-2020 shared task: Compose transformer pretrained models   for Reliable Intelligence Identification on Social network

Thanh Chinh Nguyen; Van Nha Nguyen

arXiv:2101.12672·cs.CL·February 1, 2021·1 cites

NLPBK at VLSP-2020 shared task: Compose transformer pretrained models for Reliable Intelligence Identification on Social network

Thanh Chinh Nguyen, Van Nha Nguyen

PDF

Open Access

TL;DR

This paper presents a transformer-based model combined with metadata features for reliable intelligence detection on Vietnamese social media, achieving top-tier ROC-AUC scores.

Contribution

It introduces a novel approach that integrates pretrained transformer models with metadata features for social media intelligence identification.

Findings

01

Achieved 0.9392 ROC-AUC on public test set

02

Final ROC-AUC of 0.9513 on private test set

03

Model outperforms baseline methods in the shared task

Abstract

This paper describes our method for tuning a transformer-based pretrained model, to adaptation with Reliable Intelligence Identification on Vietnamese SNSs problem. We also proposed a model that combines bert-base pretrained models with some metadata features, such as the number of comments, number of likes, images of SNS documents,... to improved results for VLSP shared task: Reliable Intelligence Identification on Vietnamese SNSs. With appropriate training techniques, our model is able to achieve 0.9392 ROC-AUC on public test set and the final version settles at top 2 ROC-AUC (0.9513) on private test set.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Misinformation and Its Impacts · Seismology and Earthquake Studies