A Measurement of In-Betweenness and Inference Based on Shape Theories
Dustin Pluta, Xiangmin Xu, Daniel L. Gillen, Zhaoxia Yu

TL;DR
This paper introduces a statistical framework with in-betweenness indices to determine if a subpopulation lies between two others in multivariate space, with applications in biology and neuroscience.
Contribution
It develops novel in-betweenness metrics and inference methods for multivariate subpopulation analysis, addressing a biological question about cell types.
Findings
Successfully applied to Iris data set, demonstrating method validity.
Analyzed breast cancer subtypes to assess risk differences.
Investigated neuronal cell types with electrophysiological features.
Abstract
We propose a statistical framework to investigate whether a given subpopulation lies between two other subpopulations in a multivariate feature space. This methodology is motivated by a biological question from a collaborator: Is a newly discovered cell type between two known types in several given features? We propose two in-betweenness indices (IBI) to quantify the in-betweenness exhibited by a random triangle formed by the summary statistics of the three subpopulations. Statistical inference methods are provided for triangle shape and IBI metrics. The application of our methods is demonstrated in three examples: the classic Iris data set, a study of risk of relapse across three breast cancer subtypes, and the motivating neuronal cell data with measured electrophysiological features.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMorphological variations and asymmetry · Gene expression and cancer classification · Cell Image Analysis Techniques
