NILE: Internal Consistency Alignment in Large Language Models

Minda Hu; Qiyuan Zhang; Yufei Wang; Bowei He; Hongru Wang; Jingyan Zhou; Liangyou Li; Yasheng Wang; Chen Ma; Irwin King

arXiv:2412.16686·cs.CL·September 23, 2025

NILE: Internal Consistency Alignment in Large Language Models

Minda Hu, Qiyuan Zhang, Yufei Wang, Bowei He, Hongru Wang, Jingyan Zhou, Liangyou Li, Yasheng Wang, Chen Ma, Irwin King

PDF

Open Access 1 Datasets 1 Video

TL;DR

The paper introduces NILE, a framework that improves instruction fine-tuning datasets by aligning them with LLMs' internal knowledge, significantly boosting model performance across various evaluation datasets.

Contribution

NILE is a novel framework that elicits and leverages internal knowledge of LLMs to revise and filter datasets, enhancing the effectiveness of instruction fine-tuning.

Findings

01

NILE-aligned datasets achieve up to 66.6% performance gain.

02

Internal consistency filtering improves dataset quality and model performance.

03

Each component of NILE contributes to performance improvements.

Abstract

As a crucial step to enhance LLMs alignment with human intentions, Instruction Fine-Tuning (IFT) has a high demand on dataset quality. However, existing IFT datasets often contain knowledge that is inconsistent with LLMs' internal knowledge learned from the pre-training phase, which can greatly affect the efficacy of IFT. To address this issue, we introduce NILE (iNternal consIstency aLignmEnt) framework, aimed at optimizing IFT datasets to unlock LLMs' capability further. NILE operates by eliciting target pre-trained LLM's internal knowledge corresponding to instruction data. The internal knowledge is leveraged to revise the answer in IFT datasets. Additionally, we propose a novel Internal Consistency Filtering (ICF) method to filter training samples, ensuring its high consistency with LLM's internal knowledge. Our experiments demonstrate that NILE-aligned IFT datasets sharply boost…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

mindahu/NILE-IFT-Dataset
dataset· 11 dl
11 dl

Videos

NILE: Internal Consistency Alignment in Large Language Models· underline

Taxonomy

TopicsTopic Modeling