Independence versus Indetermination: basis of two canonical clustering criteria
Pierre Bertrand (LPMA), Michel Broniatowski (LPMA), Jean-Fran\c{c}ois, Marcotorchino

TL;DR
This paper compares two fundamental clustering criteria, based on optimal transport theory, highlighting their properties, differences, and implications for large network clustering.
Contribution
It introduces and analyzes two canonical couplings, 'statistical independence' and 'logical indetermination', providing a comparative framework for clustering methods.
Findings
The two couplings have similar results in network clustering.
Logical indetermination is less known but significant.
Estimated average difference explains their close clustering outcomes.
Abstract
This paper aims at comparing two coupling approaches as basic layers for building clustering criteria, suited for modularizing and clustering very large networks. We briefly use "optimal transport theory" as a starting point, and a way as well, to derive two canonical couplings: "statistical independence" and "logical indetermination". A symmetric list of properties is provided and notably the so called "Monge's properties", applied to contingency matrices, and justifying the versus notation. A study is proposed, highlighting "logical indetermination", because it is, by far, lesser known. Eventually we estimate the average difference between both couplings as the key explanation of their usually close results in network clustering.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Topological and Geometric Data Analysis · Data Management and Algorithms
