A dataset of subjectivity classification in Indonesian ride-hailing app reviews
Violeta Arifin, Yuriashi Adelia Putri, Richard Wiputra

TL;DR
This paper introduces a dataset of 1338 Indonesian ride-hailing app reviews labeled for subjectivity, aiding in the analysis of user feedback in low-resource languages.
Contribution
The novel contribution is a high-quality, annotated Indonesian subjectivity dataset for ride-hailing app reviews with consensus-based labeling.
Findings
The dataset includes 1338 Indonesian ride-hailing app reviews annotated for subjectivity.
The dataset is suitable for benchmarking classifiers and analyzing cross-domain generalization.
The structured annotation design supports reproducible analysis across modeling approaches.
Abstract
As more people share their experiences online, understanding whether their reviews are subjective or objective has become key to evaluating how services are perceived, especially in the Indonesian ride-hailing industry. This article presents a subjectivity dataset of 1338 Indonesian-language ride-hailing app reviews collected from the Google Play Store. To enhance the quality and consistency of the data for analysis, all reviews were preprocessed to eliminate elements such as URLs, emojis, and extraneous characters. Two independent annotators manually annotate the dataset, followed by a consensus-based adjudication process to produce a high-quality classification. The annotation supports robust evaluation of subjectivity detection models and contributes toward developing more nuanced natural language understanding systems in low-resource languages. The data can be reused for multiple…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTransportation and Mobility Innovations · AI in Service Interactions · Persona Design and Applications
