Measurement for Opaque Systems: Multi-source Triangulation with Interpretable Machine Learning
Margaret Foster

TL;DR
This paper introduces a measurement framework for inaccessible systems using multi-source data triangulation and interpretable machine learning to derive meaningful insights despite data limitations.
Contribution
It presents a novel approach combining multi-source triangulation with interpretable ML models for measurement in data-scarce, high-stakes contexts.
Findings
Successfully applied to clandestine organization dynamics
Recovered meaningful variation from incomplete signals
Provided insights despite data fragmentation and bias
Abstract
We propose a measurement framework for difficult-to-access contexts that uses indirect data traces, interpretable machine-learning models, and theory-guided triangulation to fill inaccessible measurement spaces. Many high-stakes systems of scientific and policy interest are difficult, if not impossible, to reach directly: dynamics of interest are unobservable, data are indirect and fragmented across sources, and ground truth is absent or concealed. In these settings, available data often do not support conventional strategies for analysis, such as statistical inference on a single authoritative data stream or model validation against labeled outcomes. To address this problem, we introduce a general framework for measurement in data regimes characterized by structurally missing or adversarial data. We propose combining multi-source triangulation with interpretable machine learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI
