On convex decision regions in deep network representations
Lenka T\v{e}tkov\'a, Thea Br\"usch, Teresa Karen Scheidt, Fabian Martin Mager, Rasmus {\O}rtoft Aagaard, Jonathan Foldager, Tommy Sonne Alstr{\o}m, Lars Kai Hansen

TL;DR
This paper investigates the convexity of concept regions in deep neural network representations, proposing tools to measure it and showing its prevalence and importance for generalization and fine-tuning performance.
Contribution
It introduces methods to quantify convexity in neural representations and demonstrates its robustness and significance across various domains and training stages.
Findings
Convexity is widespread in neural representations across domains.
Fine-tuning generally increases convexity of label regions.
Pretraining convexity predicts fine-tuning success.
Abstract
Current work on human-machine alignment aims at understanding machine-learned latent spaces and their correspondence to human representations. G{\"a}rdenfors' conceptual spaces is a prominent framework for understanding human representations. Convexity of object regions in conceptual spaces is argued to promote generalizability, few-shot learning, and interpersonal alignment. Based on these insights, we investigate the notion of convexity of concept regions in machine-learned latent spaces. We develop a set of tools for measuring convexity in sampled data and evaluate emergent convexity in layered representations of state-of-the-art deep networks. We show that convexity is robust to basic re-parametrization and, hence, meaningful as a quality of machine-learned latent spaces. We find that approximate convexity is pervasive in neural representations in multiple application domains,…
Peer Reviews
Decision·Submitted to ICLR 2024
In recent years, the large language model has been demonstrating a huge potential to realize AI whilst humans don't clearly understand how it works, which makes the papers that aim to interpret deep neural networks more important. I like the idea of the paper to demystify the learned representation in terms of convexity. Also the paper derived a set of tools from convex theory and made it possible to analyze the convex property learned inside neural network. Also the paper analyzes neural n
I have concerns about the paper. Before I lay out the weaknesses list, I would like to mention that I’m not an expert of this domain. And so my suggestions and comments are possibly incorrect. Is the conclusion of the paper about neural networks new? Scientists who have been developing artificial neural networks are trying to follow the conclusions like convexity found by neural scientists. I think we, as developers of artificial neural networks, would naturally consider this property by design
The investigation of model convexity is intriguing.
1. The organization of the paper's content is unreasonable. For instance, the introduction section dedicates substantial space to discussing the relationship between convexity and generalization, while neglecting to mention the specific objectives of the paper. This section should be reorganized by moving background information to the related work or appendix while focusing on introducing the main content and contribution of the paper. 2. The experimental setup in the paper has significant issu
+ The paper is clear and easy to understand. + The idea of linking human thinking and computer learning is interesting and could be important for understanding the mind and contributing to cognitive science. + The experimental analysis in the paper is comprehensive and varied.
- The concept of measuring convexity in NNs appears to be somewhat circular. It doesn’t seem like the convexity in NNs spontaneously emerges from the data or is an unexpected outcome; rather, it seems intentionally designed. Take the "graph convexity" prominently featured in the paper, for example. If we view the final softmax layer as the classifier (setting the decision boundary) and the feature vector inputted into this final layer as a data point on a manifold, it's almost always a given for
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · AI in cancer detection
