Distinguishing artefacts: evaluating the saturation point of convolutional neural networks
Ric Real, James Gopsill, David Jones, Chris Snider, Ben Hicks

TL;DR
This study evaluates how well convolutional neural networks can classify CAD models from large repositories using synthetic images, revealing performance limits at around 200 models and potential for search applications.
Contribution
It introduces a method for generating large synthetic datasets from online CAD repositories and assesses CNN classification performance as repository size increases.
Findings
CNN classification performance deteriorates beyond 200 models.
Top-5 match rate indicates potential for search applications.
Synthetic data generation enables large-scale model classification experiments.
Abstract
Prior work has shown Convolutional Neural Networks (CNNs) trained on surrogate Computer Aided Design (CAD) models are able to detect and classify real-world artefacts from photographs. The applications of which support twinning of digital and physical assets in design, including rapid extraction of part geometry from model repositories, information search \& retrieval and identifying components in the field for maintenance, repair, and recording. The performance of CNNs in classification tasks have been shown dependent on training data set size and number of classes. Where prior works have used relatively small surrogate model data sets ( models), the question remains as to the ability of a CNN to differentiate between models in increasingly large model repositories. This paper presents a method for generating synthetic image data sets from online CAD model repositories, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
