iLCM - A Virtual Research Infrastructure for Large-Scale Qualitative Data
Andreas Niekler, Arnim Bleier, Christian Kahmann, Lisa Posch, Gregor, Wiedemann, Kenan Erdogan, Gerhard Heyer, Markus Strohmaier

TL;DR
The paper introduces iLCM, a SaaS-based research environment integrating text mining and reproducibility tools for large-scale qualitative data analysis in social sciences and digital humanities.
Contribution
It presents a novel integrated platform combining high-performance text mining with executable notebooks for reproducible research in social sciences.
Findings
Enables analysis of large qualitative datasets efficiently.
Supports reproducibility through integrated scripting environment.
Combines text mining with customizable research workflows.
Abstract
The iLCM project pursues the development of an integrated research environment for the analysis of structured and unstructured data in a "Software as a Service" architecture (SaaS). The research environment addresses requirements for the quantitative evaluation of large amounts of qualitative data with text mining methods as well as requirements for the reproducibility of data-driven research designs in the social sciences. For this, the iLCM research environment comprises two central components. First, the Leipzig Corpus Miner (LCM), a decentralized SaaS application for the analysis of large amounts of news texts developed in a previous Digital Humanities project. Second, the text mining tools implemented in the LCM are extended by an "Open Research Computing" (ORC) environment for executable script documents, so-called "notebooks". This novel integration allows to combine generic,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Data Analysis with R
