Training Non-Differentiable Networks via Optimal Transport

An T. Le

arXiv:2605.01928·cs.LG·May 5, 2026

Training Non-Differentiable Networks via Optimal Transport

An T. Le

PDF

1 Repo

TL;DR

PolyStep is a novel gradient-free optimization method for training non-differentiable neural networks using optimal transport principles, achieving state-of-the-art results across various architectures.

Contribution

It introduces PolyStep, a gradient-free optimizer based on optimal transport, that effectively trains non-differentiable models surpassing existing methods.

Findings

01

Achieves 93.4% accuracy on hard-LIF spiking networks, outperforming gradient-free baselines.

02

Maintains above 92% clause satisfaction on MAX-SAT with up to 1 million variables.

03

Matches OpenAI-ES performance on RL control tasks under quantization.

Abstract

Neural networks increasingly embed non-differentiable components (spiking neurons, quantized layers, discrete routing, blackbox simulators, etc.) where backpropagation is inapplicable and surrogate gradients introduce bias. We present PolyStep, a gradient-free optimizer that updates parameters using only forward passes. Each step evaluates the loss at structured polytope vertices in a compressed subspace, computes softmax-weighted assignments over the resulting cost matrix, and displaces particles toward low-cost vertices via barycentric projection. This update corresponds to the one-sided limit of a regularized optimal-transport problem, inheriting its geometric structure without Sinkhorn iterations. PolyStep trains genuinely non-differentiable models where existing gradient-free methods collapse to near-random accuracy. On hard-LIF spiking networks we reach 93.4% test accuracy,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anindex/polystep
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.