Investigating Neural Architectures by Synthetic Dataset Design
Adrien Courtois, Jean-Michel Morel, Pablo Arias

TL;DR
This paper proposes a systematic methodology using synthetic datasets to evaluate neural network architectures' abilities, revealing strengths and limitations in tasks like long-range dependencies, translation covariance, and pixel grouping.
Contribution
It introduces a novel synthetic dataset-based approach to measure specific neural network capabilities, providing insights beyond traditional benchmarks.
Findings
Identified a nonlocal deficit in U-Net architecture.
Showed that nonlocal layers improve depth estimation.
Demonstrated the importance of self-attention mechanisms.
Abstract
Recent years have seen the emergence of many new neural network structures (architectures and layers). To solve a given task, a network requires a certain set of abilities reflected in its structure. The required abilities depend on each task. There is so far no systematic study of the real capacities of the proposed neural structures. The question of what each structure can and cannot achieve is only partially answered by its performance on common benchmarks. Indeed, natural data contain complex unknown statistical cues. It is therefore impossible to know what cues a given neural structure is taking advantage of in such data. In this work, we sketch a methodology to measure the effect of each structure on a network's ability, by designing ad hoc synthetic datasets. Each dataset is tailored to assess a given ability and is reduced to its simplest form: each input contains exactly the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Advanced Vision and Imaging · Industrial Vision Systems and Defect Detection
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Concatenated Skip Connection · Convolution · U-Net · High-Order Consensuses
