On the choice of weights in aggregate compositional data analysis

Vartan Choulakian; Jules De Tibeiro; Pasquale Sarnacchiaro

arXiv:2301.10970·stat.AP·January 27, 2023

On the choice of weights in aggregate compositional data analysis

Vartan Choulakian, Jules De Tibeiro, Pasquale Sarnacchiaro

PDF

Open Access

TL;DR

This paper explores how to select appropriate weights in the analysis of aggregate compositional data, proposing methods for visualization and approximation based on data structure distinctions.

Contribution

It introduces a distinction between elementary and aggregate compositional data, and proposes two novel approaches for analyzing and visualizing aggregate compositional vectors.

Findings

01

Different weight choices affect log interaction analysis results.

02

Proposed two approaches: log interaction of aggregates and aggregate of log interactions.

03

First-order approximation of log interaction varies with row and column weights.

Abstract

In this paper, we distinguish between two kinds of compositional data sets: elementary and aggregate. This fact will help us to decide the choice of the weights to use in log interaction analysis of aggregate compositional vectors. We show that in the aggregate case, the underlying given data form a paired data sets composed of responses and qualitative covariates; this fact helps us to propose two approaches for analysis-visualization of data named log interaction of aggregates and aggregate of log interactions. Furthermore, we also show the first-order approximation of log interaction of a cell for different choices of the row and column weights.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGeochemistry and Geologic Mapping