Q-Sat AI: Machine Learning-Based Decision Support for Data Saturation in Qualitative Studies
Hasan Tutar, Caner Erden, \"Umit \c{S}ent\"urk

TL;DR
This paper presents a machine learning model that objectively determines data saturation in qualitative research, aiming to standardize sample size decisions and improve methodological rigor.
Contribution
It introduces a novel ensemble ML approach using key qualitative parameters to predict data saturation, enhancing transparency and consistency in qualitative sampling.
Findings
ML models achieved high explanatory power (Test R2 ~ 0.85).
Research design type and information power are key features.
Proposes a web-based decision support tool for qualitative researchers.
Abstract
The determination of sample size in qualitative research has traditionally relied on the subjective and often ambiguous principle of data saturation, which can lead to inconsistencies and threaten methodological rigor. This study introduces a new, systematic model based on machine learning (ML) to make this process more objective. Utilizing a dataset derived from five fundamental qualitative research approaches - namely, Case Study, Grounded Theory, Phenomenology, Narrative Research, and Ethnographic Research - we developed an ensemble learning model. Ten critical parameters, including research scope, information power, and researcher competence, were evaluated using an ordinal scale and used as input features. After thorough preprocessing and outlier removal, multiple ML algorithms were trained and compared. The K-Nearest Neighbors (KNN), Gradient Boosting (GB), Random Forest (RF),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQualitative Research Methods and Applications · Computational and Text Analysis Methods · Data Analysis and Archiving
