Generalizing Large Language Model Usability Across Resource-Constrained
Yun-Da Tsai

TL;DR
This paper systematically explores methods to improve the adaptability, robustness, and efficiency of large language models in resource-constrained and multimodal environments without extensive retraining.
Contribution
It introduces a unified framework combining modality integration, adversarial prompting, inference-time optimization, and low-resource domain techniques to enhance LLM usability in real-world settings.
Findings
Supports seamless multimodal integration via natural language interfaces.
Improves robustness with adversarial prompt-based stress testing.
Achieves state-of-the-art results in low-resource code generation.
Abstract
Large Language Models (LLMs) have achieved remarkable success across a wide range of natural language tasks, and recent efforts have sought to extend their capabilities to multimodal domains and resource-constrained environments. However, existing approaches often rely on costly supervised fine-tuning or assume fixed training conditions, limiting their generalization when facing unseen modalities, limited data, or restricted compute resources. This dissertation presents a systematic study toward generalizing LLM usability under real-world constraints. First, it introduces a robust text-centric alignment framework that enables LLMs to seamlessly integrate diverse modalities-including text, images, tables, and any modalities - via natural language interfaces. This approach supports in-context adaptation to unseen or dynamically changing modalities without requiring retraining. To enhance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
