Online Social Support Detection in Spanish Social Media Texts
Moein Shahiki Tash, Luis Ramos, Zahra Ahani, Raul Monroy, Olga, kolesnikova, Hiram Calvo, Grigori Sidorov

TL;DR
This paper presents a new dataset and methods for detecting online social support in Spanish social media comments, showing that dataset balancing improves classification performance and highlighting GPT-4o's effectiveness.
Contribution
Introduces the first annotated dataset for Spanish social support detection and compares traditional, deep learning, and transformer models, emphasizing the impact of dataset balancing.
Findings
Balanced dataset improves classification accuracy for most tasks.
GPT-4o performs best on support detection in unbalanced data.
Transformer models outperform traditional machine learning methods.
Abstract
The advent of social media has transformed communication, enabling individuals to share their experiences, seek support, and participate in diverse discussions. While extensive research has focused on identifying harmful content like hate speech, the recognition and promotion of positive and supportive interactions remain largely unexplored. This study proposes an innovative approach to detecting online social support in Spanish-language social media texts. We introduce the first annotated dataset specifically created for this task, comprising 3,189 YouTube comments classified as supportive or non-supportive. To address data imbalance, we employed GPT-4o to generate paraphrased comments and create a balanced dataset. We then evaluated social support classification using traditional machine learning models, deep learning architectures, and transformer-based models, including GPT-4o, but…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Communication and Language · Misinformation and Its Impacts · Hate Speech and Cyberbullying Detection
