"You might also like this model": Data Driven Approach for Recommending Deep Learning Models for Unknown Image Datasets
Ameya Prabhu, Riddhiman Dasgupta, Anush Sankaran, Srikanth, Tamilselvam, Senthil Mani

TL;DR
This paper introduces a data-driven method to recommend suitable deep learning architectures for unknown image datasets and predict their performance without training, streamlining model selection.
Contribution
It presents a novel architecture recommendation system using a model encoder and performance prediction for unseen datasets, enhancing efficiency in deep learning model selection.
Findings
Predicted accuracy closely matches actual performance.
Effective for image datasets, with challenges noted for text data.
Repository and implementation are publicly available.
Abstract
For an unknown (new) classification dataset, choosing an appropriate deep learning architecture is often a recursive, time-taking, and laborious process. In this research, we propose a novel technique to recommend a suitable architecture from a repository of known models. Further, we predict the performance accuracy of the recommended architecture on the given unknown dataset, without the need for training the model. We propose a model encoder approach to learn a fixed length representation of deep learning architectures along with its hyperparameters, in an unsupervised fashion. We manually curate a repository of image datasets with corresponding known deep learning models and show that the predicted accuracy is a good estimator of the actual accuracy. We discuss the implications of the proposed approach for three benchmark images datasets and also the challenges in using the approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Advanced Neural Network Applications
