Adaptive Routing of Text-to-Image Generation Requests Between Large Cloud Model and Light-Weight Edge Model
Zewei Xin, Qinya Li, Chaoyue Niu, Fan Wu, Guihai Chen

TL;DR
This paper introduces RouteT2I, a dynamic routing framework that intelligently chooses between large cloud and lightweight edge models for text-to-image generation, balancing quality and cost effectively.
Contribution
It proposes a novel multi-metric quality evaluation method and a routing strategy that optimizes model selection based on prompt complexity and quality-cost trade-offs.
Findings
Reduces cloud model requests significantly
Maintains high image quality with fewer cloud requests
Balances performance and cost effectively
Abstract
Large text-to-image models demonstrate impressive generation capabilities; however, their substantial size necessitates expensive cloud servers for deployment. Conversely, light-weight models can be deployed on edge devices at lower cost but often with inferior generation quality for complex user prompts. To strike a balance between performance and cost, we propose a routing framework, called RouteT2I, which dynamically selects either the large cloud model or the light-weight edge model for each user prompt. Since generated image quality is challenging to measure and compare directly, RouteT2I establishes multi-dimensional quality metrics, particularly, by evaluating the similarity between the generated images and both positive and negative texts that describe each specific quality metric. RouteT2I then predicts the expected quality of the generated images by identifying key tokens in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Advanced Data and IoT Technologies · Recommender Systems and Techniques
