On convex decision regions in deep network representations

Lenka T\v{e}tkov\'a; Thea Br\"usch; Teresa Karen Scheidt; Fabian Martin Mager; Rasmus {\O}rtoft Aagaard; Jonathan Foldager; Tommy Sonne Alstr{\o}m; Lars Kai Hansen

arXiv:2305.17154·cs.LG·July 3, 2025·1 cites

On convex decision regions in deep network representations

Lenka T\v{e}tkov\'a, Thea Br\"usch, Teresa Karen Scheidt, Fabian Martin Mager, Rasmus {\O}rtoft Aagaard, Jonathan Foldager, Tommy Sonne Alstr{\o}m, Lars Kai Hansen

PDF

Open Access 1 Repo 3 Reviews

TL;DR

This paper investigates the convexity of concept regions in deep neural network representations, proposing tools to measure it and showing its prevalence and importance for generalization and fine-tuning performance.

Contribution

It introduces methods to quantify convexity in neural representations and demonstrates its robustness and significance across various domains and training stages.

Findings

01

Convexity is widespread in neural representations across domains.

02

Fine-tuning generally increases convexity of label regions.

03

Pretraining convexity predicts fine-tuning success.

Abstract

Current work on human-machine alignment aims at understanding machine-learned latent spaces and their correspondence to human representations. G{\"a}rdenfors' conceptual spaces is a prominent framework for understanding human representations. Convexity of object regions in conceptual spaces is argued to promote generalizability, few-shot learning, and interpersonal alignment. Based on these insights, we investigate the notion of convexity of concept regions in machine-learned latent spaces. We develop a set of tools for measuring convexity in sampled data and evaluate emergent convexity in layered representations of state-of-the-art deep networks. We show that convexity is robust to basic re-parametrization and, hence, meaningful as a quality of machine-learned latent spaces. We find that approximate convexity is pervasive in neural representations in multiple application domains,…

Peer Reviews

Decision·Submitted to ICLR 2024

Reviewer 01Rating 5· marginally below the acceptance thresholdConfidence 3

Strengths

In recent years, the large language model has been demonstrating a huge potential to realize AI whilst humans don't clearly understand how it works, which makes the papers that aim to interpret deep neural networks more important. I like the idea of the paper to demystify the learned representation in terms of convexity. Also the paper derived a set of tools from convex theory and made it possible to analyze the convex property learned inside neural network. Also the paper analyzes neural n

Weaknesses

I have concerns about the paper. Before I lay out the weaknesses list, I would like to mention that I’m not an expert of this domain. And so my suggestions and comments are possibly incorrect. Is the conclusion of the paper about neural networks new? Scientists who have been developing artificial neural networks are trying to follow the conclusions like convexity found by neural scientists. I think we, as developers of artificial neural networks, would naturally consider this property by design

Reviewer 02Rating 5· marginally below the acceptance thresholdConfidence 3

Strengths

The investigation of model convexity is intriguing.

Weaknesses

1. The organization of the paper's content is unreasonable. For instance, the introduction section dedicates substantial space to discussing the relationship between convexity and generalization, while neglecting to mention the specific objectives of the paper. This section should be reorganized by moving background information to the related work or appendix while focusing on introducing the main content and contribution of the paper. 2. The experimental setup in the paper has significant issu

Reviewer 03Rating 6· marginally above the acceptance thresholdConfidence 4

Strengths

+ The paper is clear and easy to understand. + The idea of linking human thinking and computer learning is interesting and could be important for understanding the mind and contributing to cognitive science. + The experimental analysis in the paper is comprehensive and varied.

Weaknesses

- The concept of measuring convexity in NNs appears to be somewhat circular. It doesn’t seem like the convexity in NNs spontaneously emerges from the data or is an unexpected outcome; rather, it seems intentionally designed. Take the "graph convexity" prominently featured in the paper, for example. If we view the final softmax layer as the classifier (setting the decision boundary) and the feature vector inputted into this final layer as a data point on a manifold, it's almost always a given for

Code & Models

Repositories

lenkatetkova/convexity-of-representations
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · AI in cancer detection