Text-to-Model: Text-Conditioned Neural Network Diffusion for Train-Once-for-All Personalization
Zexi Li, Lingzhi Gao, Chao Wu

TL;DR
This paper introduces Tina, a neural network diffusion model conditioned on text, capable of generating personalized models for diverse tasks from minimal data, demonstrating strong generalization and understanding of world knowledge.
Contribution
We propose Tina, a novel text-conditioned neural network diffusion approach enabling train-once-for-all personalization across numerous tasks with minimal data.
Findings
Tina generalizes well to in-distribution and out-of-distribution tasks.
Tina effectively handles zero-shot and few-shot scenarios.
Tina demonstrates understanding of world knowledge through natural language prompts.
Abstract
Generative artificial intelligence (GenAI) has made significant progress in understanding world knowledge and generating content from human languages across various modalities, like text-to-text large language models, text-to-image stable diffusion, and text-to-video Sora. While in this paper, we investigate the capability of GenAI for text-to-model generation, to see whether GenAI can comprehend hyper-level knowledge embedded within AI itself parameters. Specifically, we study a practical scenario termed train-once-for-all personalization, aiming to generate personalized models for diverse end-users and tasks using text prompts. Inspired by the recent emergence of neural network diffusion, we present Tina, a text-conditioned neural network diffusion for train-once-for-all personalization. Tina leverages a diffusion transformer model conditioned on task descriptions embedded using a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel-Driven Software Engineering Techniques · Business Process Modeling and Analysis
MethodsContrastive Language-Image Pre-training · Diffusion
