Analysis of composition on the original scale of measurement
David Firth, Fiona Sammut

TL;DR
This paper proposes a direct statistical modeling approach for compositional data on the original measurement scale, offering advantages over traditional log-ratio methods, especially in handling zeros and interpretability.
Contribution
It introduces a variance-covariance function based on multiplicative errors and advocates for quasi-likelihood analysis as a robust alternative to log-ratio transformations.
Findings
Robustness to zero or near-zero measurements
More direct interpretation of compositional data
Applicability of quasi-likelihood methods to composition analysis
Abstract
In current applied research the most-used route to an analysis of composition is through log-ratios -- that is, contrasts among log-transformed measurements. Here we argue instead for a more direct approach, using a statistical model for the arithmetic mean on the original scale of measurement. Central to the approach is a general variance-covariance function, derived by assuming multiplicative measurement error. Quasi-likelihood analysis of logit models for composition is then a general alternative to the use of multivariate linear models for log-ratio transformed measurements, and it has important advantages. These include robustness to secondary aspects of model specification, stability when there are zero-valued or near-zero measurements in the data, and more direct interpretation. The usual efficiency property of quasi-likelihood estimation applies even when the error covariance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Criteria Decision Making · Bayesian Modeling and Causal Inference
