Online Domain-aware LLM Decoding for Continual Domain Evolution
Mohammad Abu-Shaira, Weishi Shi

TL;DR
This paper introduces ODD, a real-time decoding framework that adaptively fuses a base LLM with a prefix-tree prior to handle continuous domain evolution and concept drift without retraining.
Contribution
The paper proposes a novel online decoding method that dynamically adjusts to domain changes, improving LLM performance in evolving environments without costly retraining.
Findings
ODD outperforms baseline methods across all metrics.
Achieves 0.065 ROUGE-L gain and 13.6% cosine similarity improvement.
Demonstrates robustness to lexical and contextual shifts.
Abstract
LLMs are typically fine-tuned offline on domain-specific data, assuming a static domain. In practice, domain knowledge evolves continuously through new regulations, products, services, and interaction patterns. Retraining or fine-tuning LLMs for every new instance is computationally infeasible. Additionally, real-world environments also exhibit temporal dynamics with shifting data distributions. Disregarding this phenomenon, commonly referred to as concept drift, can significantly diminish a model's predictive accuracy. This mismatch between evolving domains and static adaptation pipelines highlights the need for efficient, real-time adaptation without costly retraining. In response, we introduce Online Domain-aware Decoding framework (ODD). ODD performs probability-level fusion between a base LLM and a prefix-tree prior, guided by adaptive confidence modulation using disagreement and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Topic Modeling · Domain Adaptation and Few-Shot Learning
