IRepair: An Intent-Aware Approach to Repair Data-Driven Errors in Large   Language Models

Sayem Mohammad Imtiaz; Astha Singh; Fraol Batole; Hridesh Rajan

arXiv:2502.07072·cs.CL·March 12, 2025

IRepair: An Intent-Aware Approach to Repair Data-Driven Errors in Large Language Models

Sayem Mohammad Imtiaz, Astha Singh, Fraol Batole, Hridesh Rajan

PDF

Open Access

TL;DR

This paper introduces IRepair, a novel intent-aware, dynamic slicing method for repairing large language models by focusing on the most error-prone layers, improving repair effectiveness while preserving overall performance.

Contribution

The paper proposes a dynamic, intent-aware slicing technique for targeted model repair, reducing damage to general performance and focusing on error-prone sections of LLMs.

Findings

01

IRepair repairs errors 43.6% more effectively than baselines.

02

It causes 46% less disruption to general performance.

03

Errors are concentrated in the top 20% of model layers.

Abstract

Not a day goes by without hearing about the impressive feats of large language models (LLMs), and equally, not a day passes without hearing about their challenges. LLMs are notoriously vulnerable to biases in their dataset, leading to issues such as toxicity. While domain-adaptive training has been employed to mitigate these issues, these techniques often address all model parameters indiscriminately during the repair process, resulting in poor repair quality and reduced model versatility. In this paper, we introduce a novel dynamic slicing-based intent-aware LLM repair strategy, IRepair. This approach selectively targets the most error-prone sections of the model for repair. Specifically, we propose dynamically slicing the model's most sensitive layers that require immediate attention, concentrating repair efforts on those areas. This method enables more effective repairs with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management

MethodsGPT-Neo