GSVD for Geometry-Grounded Dataset Comparison: An Alignment Angle Is All You Need
Eduarda de Souza Marques, Arthur Sobrinho Ferreira da Rocha, Joao Paixao, Heudson Mirandola, Daniel Sadoc Menasche

TL;DR
This paper introduces an angle-based diagnostic score derived from the generalized singular value decomposition (GSVD) to compare datasets in a geometry-grounded manner, providing interpretable insights into data relations.
Contribution
It revisits the classical GSVD primitive, operationalizes it for dataset comparison, and introduces an interpretable angle score for geometric diagnostics.
Findings
The angle score effectively quantifies dataset similarities and differences.
The method provides interpretable geometric diagnostics for data analysis.
Application to MNIST demonstrates practical utility.
Abstract
Geometry-grounded learning asks models to respect structure in the problem domain rather than treating observations as arbitrary vectors. Motivated by this view, we revisit a classical but underused primitive for comparing datasets: linear relations between two data matrices, expressed via the co-span constraint in a shared ambient space. To operationalize this comparison, we use the generalized singular value decomposition (GSVD) as a joint coordinate system for two subspaces. In particular, we exploit the GSVD form , with , which separates shared versus dataset-specific directions through the diagonal structure of . From these factors we derive an interpretable *angle score* for a sample , quantifying whether z is explained relatively more by , more by , or comparably by both. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMorphological variations and asymmetry · Face and Expression Recognition · Tensor decomposition and applications
