CGI: Identifying Conditional Generative Models with Example Images
Zhi Zhou, Hao-Zhe Tan, Peng-Xiao Song, Lan-Zhe Guo

TL;DR
This paper introduces CGI, a method for identifying suitable generative models using user-provided images, improving search efficiency in large model hubs, and demonstrating high accuracy and better FID scores.
Contribution
It proposes the PMI approach for precise model description and matching, along with a benchmark of 65 models and 9100 tasks to evaluate effectiveness.
Findings
92% model identification accuracy with four example images
Significantly improved FID scores
Effective human evaluation results
Abstract
Generative models have achieved remarkable performance recently, and thus model hubs have emerged. Existing model hubs typically assume basic text matching is sufficient to search for models. However, in reality, due to different abstractions and the large number of models in model hubs, it is not easy for users to review model descriptions and example images, choosing which model best meets their needs. Therefore, it is necessary to describe model functionality wisely so that future users can efficiently search for the most suitable model for their needs. Efforts to address this issue remain limited. In this paper, we propose Conditional Generative Model Identification (CGI), which aims to provide an effective way to identify the most suitable model using user-provided example images rather than requiring users to manually review a large number of models with example images. To address…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Modeling in Geospatial Applications · Semantic Web and Ontologies
