Tree-Preconditioned Differentiable Optimization and Axioms as Layers
Yuexin Liao

TL;DR
This paper presents a novel differentiable framework embedding axiomatic structures of Random Utility Models into neural networks, enabling scalable, provably rational models that generalize well from sparse data.
Contribution
It introduces a Tree-Preconditioned Conjugate Gradient solver and a differentiable projection layer based on axioms, improving scalability and model rationality.
Findings
Achieves superlinear convergence in large-scale problems.
Eliminates structural overfitting compared to penalty-based methods.
Enables joint training of rational models from limited data.
Abstract
This paper introduces a differentiable framework that embeds the axiomatic structure of Random Utility Models (RUM) directly into deep neural networks. Although projecting empirical choice data onto the RUM polytope is NP-hard in general, we uncover an isomorphism between RUM consistency and flow conservation on the Boolean lattice. Leveraging this combinatorial structure, we derive a novel Tree-Preconditioned Conjugate Gradient solver. By exploiting the spanning tree of the constraint graph, our preconditioner effectively "whitens" the ill-conditioned Hessian spectrum induced by the Interior Point Method barrier, achieving superlinear convergence and scaling to problem sizes previously deemed unsolvable. We further formulate the projection as a differentiable layer via the Implicit Function Theorem, where the exact Jacobian propagates geometric constraints during backpropagation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Constraint Satisfaction and Optimization · Complexity and Algorithms in Graphs
