TL;DR
This paper introduces a generalized Sobolev IPM framework for graph-based measures using Orlicz geometry, enabling flexible structural priors and efficient computation, with applications in document classification and topological data analysis.
Contribution
It extends Sobolev IPM to Orlicz geometric structures, providing a flexible framework that captures diverse geometric priors and offers computational efficiency through Musielak norm regularization.
Findings
GSI-M is several orders faster than OW in computation.
GSI-M effectively compares probability measures on graphs.
Demonstrates advantages in document classification and topological data analysis.
Abstract
We study the Sobolev IPM problem for measures supported on a graph metric space, where critic function is constrained to lie within the unit ball defined by Sobolev norm. While Le et al. (2025) achieved scalable computation by relating Sobolev norm to weighted -norm, the resulting framework remains intrinsically bound to geometric structure, limiting its ability to incorporate alternative structural priors beyond the geometry paradigm. To overcome this limitation, we propose to generalize Sobolev IPM through the lens of \emph{Orlicz geometric structure}, which employs convex functions to capture nuanced geometric relationships, building upon recent advances in optimal transport theory -- particularly Orlicz-Wasserstein (OW) and generalized Sobolev transport -- that have proven instrumental in advancing machine learning methodologies. This generalization encompasses…
Peer Reviews
Decision·Submitted to ICLR 2026
- The extension of Sobolev IPMs from $L^p$ spaces to Orlicz spaces is reasonable. - The authoors address the computational tractability of this metric. - The paper shows that the proposed GSI-M is a metric and is equivalent to GSI. - The empirical results show that GSI-M is computationally more efficient than the related Orlicz-Wasserstein (OW) distance.
- The novelty of the core technical insight appears largely incremental and derivative. The paper leverages the exact same weight function $\hat{w}(x)$ that was a key finding in Le et al. (2025) to relate the norms. This makes the paper feel like a direct substitution of $L^p$ norms with Orlicz/Musielak norms onto the framework of Le et al. (2025). - The practical motivation for the generalization is weak. The experiments do not demonstrate a compelling advantage for using the more complex Orli
**Theoretical innovation and significance:** The paper makes a significant contribution by generalizing Sobolev IPM through Orlicz geometry, creating meaningful connections between integral probability metrics and transport distances on graphs. The rigorous proofs establishing relationships between GSI-M and GST (Proposition 4.6: $\frac{1}{2} \text{GST}\_{\Phi}(\mu,\nu) \leq \hat{GS}\_{\Phi}(\mu,\nu) \leq \text{GST}_{\Phi}(\mu,\nu)$) provide valuable theoretical insights that advance beyond prio
**Limited experimental validation with key baselines:** While the paper cites Fused Gromov-Wasserstein (FGW) and Fused Partial Gromov-Wasserstein (FPGW) (Bai et al., 2025; Brogat-Motte et al., 2022), it lacks direct comparisons with these methods. Given that FGW has become a standard for structured object matching, including these comparisons would significantly strengthen the empirical validation and better position GSI-M within the broader landscape of graph-based distance metrics. **Narrow e
The main computational advantage of the proposed method lies in the assumption that the compared probability measures are supported on the same graph. Under this assumption, the computation of the optimal transport plan, required in Wasserstein or OW distances, can be avoided, similar to the case where a closed-form solution exists for the Wasserstein distance in the one-dimensional setting. This advantage has also been extensively exploited in several prior works [1, 2, 3].
- At first glance, I thought this paper extends [1], in the sense that the Sobolev IPM in [1] is defined on an L_p geometric structure, while the proposed method generalizes it by replacing the L_p structure with an Orlicz geometric one. However, I found that the method called generalized Sobolev transport (GST, [2]) also employs an Orlicz geometric structure to generalize L_p. Moreover, my understanding is that the proposed method is essentially a weighted version of [2], that is, it introduces
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
