Mortgage Language Model: Domain-Adaptive Pretraining with Residual Instruction, Alignment Tuning, and Task-Specific Routing

Manish Jain; Satheesh Kumar Ponnambalam; Salman Faroz; Chandrakanth Lns; Vinay Sharma

arXiv:2511.21101·cs.CL·December 10, 2025

Mortgage Language Model: Domain-Adaptive Pretraining with Residual Instruction, Alignment Tuning, and Task-Specific Routing

Manish Jain, Satheesh Kumar Ponnambalam, Salman Faroz, Chandrakanth Lns, Vinay Sharma

PDF

Open Access

TL;DR

MortgageLLM is a domain-specific large language model for mortgage finance that uses a dual-expert architecture and residual instruction techniques to enhance specialized task performance while maintaining instruction-following capabilities.

Contribution

This paper introduces a dual-expert architecture and residual instruction technique for domain adaptation in large language models, specifically applied to mortgage finance.

Findings

01

Significantly outperforms base LLaMA-3.1-8B on domain benchmarks.

02

Achieves higher scores in summarization, Q&A, and classification tasks.

03

Demonstrates effective domain adaptation without sacrificing instruction-following fidelity.

Abstract

Large Language Models (LLMs) demonstrate exceptional capabilities across general domains, yet their application to specialized sectors such as mortgage finance requires domain-specific knowledge augmentation while preserving instruction-following fidelity. We present MortgageLLM, a novel domain-specific large language model that addresses this dual challenge. It is developed using a dual-track specialization framework from a single base model (LLaMA-3.1-8B). We opted for this dual-expert approach as a single multi-task model suffers from performance trade-offs, where optimizing for structured tasks (via SFT) degrades conversational fidelity (via DPO). Our dual-track method solves this by creating two specialists, allowing each to be optimally trained for its distinct capability. Our approach applies the instruction residual technique to restore instruction-following capabilities…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Artificial Intelligence in Healthcare and Education