Matrix dissimilarities based on differences in moments and sparsity

Li Tuobang

arXiv:2406.02051·q-bio.QM·September 11, 2024

Matrix dissimilarities based on differences in moments and sparsity

Li Tuobang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel dissimilarity measure based on differences in moments and sparsity, providing deeper insights into group differences across various biological and social datasets.

Contribution

The paper presents a new dissimilarity approach that captures key factors like moments and sparsity, enhancing analysis of complex data beyond traditional methods.

Findings

01

Sparsity dissimilarity is as effective as mean dissimilarity in predicting COVID-19 drug effects.

02

The method reveals underlying biological factors such as gene regulation and heterogeneity.

03

Extensive dataset reanalysis demonstrates the approach's broad applicability.

Abstract

Generating a dissimilarity matrix is typically the first step in big data analysis. Although numerous methods exist, such as Euclidean distance, Minkowski distance, Manhattan distance, Bray Curtis dissimilarity, Jaccard similarity and Dice dissimilarity, it remains unclear which factors drive dissimilarity between groups. In this paper, we introduce an approach based on differences in moments and sparsity. We show that this method can delineate the key factors underlying group differences. For example, in biology, mean dissimilarity indicates differences driven by up down regulated gene expressions, standard deviation dissimilarity reflects the heterogeneity of response to treatment, and sparsity dissimilarity corresponds to differences prompted by the activation silence of genes. Through extensive reanalysis of genome, transcriptome, proteome, metabolome, immune profiling, microbiome,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tubanlee/MD
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Signal Denoising Methods