Convex Efficient Coding
William Dorrell, Peter E. Latham, James Whittington

TL;DR
This paper introduces a family of convex normative models for neural coding based on representational similarity, enabling tractable analysis of complex neural encoding strategies and deriving new insights into neural representations.
Contribution
It constructs a set of convex, flexible normative theories of neural coding based on representational similarity, including new results on identifiability and neural tuning analysis.
Findings
First necessary and sufficient identifiability result for semi-nonnegative matrix factorisation.
Neural tunings are uniquely linked to optimal representational similarity under certain conditions.
Dense retinal codes, but not sparse cortical codes, optimally split encoding into ON & OFF channels.
Abstract
Why do neurons encode information the way they do? Normative answers to this question model neural activity as the solution to an optimisation problem; for example, the celebrated efficient coding hypothesis frames neural activity as the optimal encoding of information under efficiency constraints. Successful normative theories have varied dramatically in complexity, from simple linear models (Atick & Redlich '90), to complex deep neural networks (Lindsay '21). What complex models gain in flexibility, they lose in tractability and often understandability. Here, we split the difference by constructing a set of tractable but flexible normative representational theories. Instead of optimising the neural activities directly, following Sengupta et al. '18, we optimise the representational similarity, a matrix formed from the dot products of each pair of neural responses. Using this, we show…
Peer Reviews
Decision·ICLR 2026 Poster
- The heart of the paper, that optimizations problems that may not be convex in a representation $\bf{Z}$ can be convex in the RSM $\bf{Q} = \bf{Z}^T \bf{Z}$, is both quite interesting, well placed in the literature of neural circuits (via the anchor of similarity matching objectives), and to my knowledge novel. - This key observation is used to substantial effect: it allows for the generalization of previous results in matrix factorization, and can be used to derive interpretable insights into
- Theorem 1 generalizes a previous results on problem 1 to the case of arbitrary (linear) source mixing (from orthogonal linear mixing). It is not obvious to me how significant this generalization is in practice. I.e. the value of this contribution could be made more clear if the author's outlined some real world cases where this contribution would be necessary to deliver theoretical predictions. - Regarding the limitations of $N_{neurons} > N_{samples}$ and the assumption of perfect fitting: i
* Originality: Unifies a large class of efficient-coding models under a convex RS-matrix formulation. The "tight-scattering" condition provides an elegant, necessary-and-sufficient identifiability test not seen in prior work. * Quality: Derivations are correct and geometrically motivated. Proofs connect convex geometry to neural interpretability. * Clarity: Writing is clean, with an explicit "menu" of convex constraints and objectives. * Significance: Offers theoretical justification for single-
* The proposed semi-nonnegative factorization closely mirrors nonnegative sparse coding and semi-NMF [1]. In both cases, only the code (not the dictionary) is nonnegative, which already produces parts-based modularity. The paper should explicitly benchmark against these baselines under matched sparsity or energy constraints, clarifying whether convex RS-matrix optimization yields new behavior or simply a reformulation. * The paper equates "representational similarity" with geometry preservation,
****The paper carefully studies a set of constrained optimization problems that are akin to semi-negative matrix factorization. Some analytical results are derived and presented. ****Some discussions on the connections to neuroscience were presented.
****The work is quite incremental. The results, in particular the applications of the modeling framework, is rather preliminary. ****Despite of the claim that the model framework analyzed is widely used in neuroscience, it is actually quite limited. Maybe the framework and the results are indeed potentially useful to understanding some phenomena in neuroscience. If that is the case, it would be helpful to show these applications directly in the paper. Section 4 mentioned some potential appli
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace Recognition and Perception · Neural dynamics and brain function · Retinal Development and Disorders
