LASTIST: LArge-Scale Target-Independent STance dataset

DongJae Kim; Yaejin Lee; Minsu Park; Eunil Park

arXiv:2510.25783·cs.CL·October 31, 2025

LASTIST: LArge-Scale Target-Independent STance dataset

DongJae Kim, Yaejin Lee, Minsu Park, Eunil Park

PDF

TL;DR

This paper introduces LASTIST, a large-scale Korean stance detection dataset, enabling research in target-independent stance detection and addressing low-resource language challenges.

Contribution

The paper presents the creation of the first large-scale Korean stance dataset, LASTIST, and demonstrates its utility for various stance detection tasks.

Findings

01

Successfully collected 563,299 labeled Korean sentences.

02

Trained state-of-the-art deep learning models on the dataset.

03

Enabled target-independent and diachronic stance detection in Korean.

Abstract

Stance detection has emerged as an area of research in the field of artificial intelligence. However, most research is currently centered on the target-dependent stance detection task, which is based on a person's stance in favor of or against a specific target. Furthermore, most benchmark datasets are based on English, making it difficult to develop models in low-resource languages such as Korean, especially for an emerging field such as stance detection. This study proposes the LArge-Scale Target-Independent STance (LASTIST) dataset to fill this research gap. Collected from the press releases of both parties on Korean political parties, the LASTIST dataset uses 563,299 labeled Korean sentences. We provide a detailed description of how we collected and constructed the dataset and trained state-of-the-art deep learning and stance detection models. Our LASTIST dataset is designed for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.