Adaptive Routing of Text-to-Image Generation Requests Between Large Cloud Model and Light-Weight Edge Model

Zewei Xin; Qinya Li; Chaoyue Niu; Fan Wu; Guihai Chen

arXiv:2411.13787·cs.CV·August 22, 2025

Adaptive Routing of Text-to-Image Generation Requests Between Large Cloud Model and Light-Weight Edge Model

Zewei Xin, Qinya Li, Chaoyue Niu, Fan Wu, Guihai Chen

PDF

Open Access

TL;DR

This paper introduces RouteT2I, a dynamic routing framework that intelligently chooses between large cloud and lightweight edge models for text-to-image generation, balancing quality and cost effectively.

Contribution

It proposes a novel multi-metric quality evaluation method and a routing strategy that optimizes model selection based on prompt complexity and quality-cost trade-offs.

Findings

01

Reduces cloud model requests significantly

02

Maintains high image quality with fewer cloud requests

03

Balances performance and cost effectively

Abstract

Large text-to-image models demonstrate impressive generation capabilities; however, their substantial size necessitates expensive cloud servers for deployment. Conversely, light-weight models can be deployed on edge devices at lower cost but often with inferior generation quality for complex user prompts. To strike a balance between performance and cost, we propose a routing framework, called RouteT2I, which dynamically selects either the large cloud model or the light-weight edge model for each user prompt. Since generated image quality is challenging to measure and compare directly, RouteT2I establishes multi-dimensional quality metrics, particularly, by evaluating the similarity between the generated images and both positive and negative texts that describe each specific quality metric. RouteT2I then predicts the expected quality of the generated images by identifying key tokens in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGraph Theory and Algorithms · Advanced Data and IoT Technologies · Recommender Systems and Techniques