FeClustRE: Hierarchical Clustering and Semantic Tagging of App Features from User Reviews
Max Tiessler, Quim Motger

TL;DR
FeClustRE is a hybrid framework that extracts, clusters, and semantically labels app features from user reviews, providing structured insights to improve requirements analysis and app comparison.
Contribution
It introduces a novel hybrid approach combining syntactic parsing and LLMs for feature extraction, hierarchical clustering with auto-tuning, and automatic taxonomy labeling, all open-source.
Findings
High extraction accuracy on benchmark datasets
Effective semantic clustering and labeling of app features
Enhanced interpretability and organization of user feedback
Abstract
[Context and motivation.] Extracting features from mobile app reviews is increasingly important for multiple requirements engineering (RE) tasks. However, existing methods struggle to turn noisy, ambiguous feedback into interpretable insights. [Question/problem.] Syntactic approaches lack semantic depth, while large language models (LLMs) often miss fine-grained features or fail to structure them coherently. In addition, existing methods output flat lists of features without semantic organization, limiting interpretation and comparability. Consequently, current feature extraction approaches do not provide structured, meaningful representations of app features. As a result, practitioners face fragmented information that hinder requirement analysis, prioritization, and cross-app comparison, among other use cases. [Principal ideas/results.] In this context, we propose FeClustRE, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Techniques and Practices · Persona Design and Applications · Advanced Malware Detection Techniques
