ViCGCN: Graph Convolutional Network with Contextualized Language Models for Social Media Mining in Vietnamese
Chau-Thang Phan, Quoc-Nam Nguyen, Chi-Thanh Dang, Trong-Hop Do, Kiet, Van Nguyen

TL;DR
This paper introduces ViCGCN, a novel model combining PhoBERT and Graph Convolutional Networks to improve Vietnamese social media text classification by addressing data imbalance and noise, achieving state-of-the-art results.
Contribution
The study presents a new approach that integrates contextualized language models with GCNs specifically for Vietnamese social media mining, outperforming existing models.
Findings
ViCGCN outperforms 13 baseline models on three datasets.
Significant accuracy improvements of up to 6.21%.
Applying GCN to BERT models enhances performance.
Abstract
Social media processing is a fundamental task in natural language processing with numerous applications. As Vietnamese social media and information science have grown rapidly, the necessity of information-based mining on Vietnamese social media has become crucial. However, state-of-the-art research faces several significant drawbacks, including imbalanced data and noisy data on social media platforms. Imbalanced and noisy are two essential issues that need to be addressed in Vietnamese social media texts. Graph Convolutional Networks can address the problems of imbalanced and noisy data in text classification on social media by taking advantage of the graph structure of the data. This study presents a novel approach based on contextualized language model (PhoBERT) and graph-based method (Graph Convolutional Networks). In particular, the proposed approach, ViCGCN, jointly trained the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Text and Document Classification Technologies · Topic Modeling
MethodsGraph Convolutional Network
