Gabliteration: Adaptive Multi-Directional Neural Weight Modification for Selective Behavioral Alteration in Large Language Models

G\"okdeniz G\"ulmez

arXiv:2512.18901·cs.AI·January 29, 2026

Gabliteration: Adaptive Multi-Directional Neural Weight Modification for Selective Behavioral Alteration in Large Language Models

G\"okdeniz G\"ulmez

PDF

Open Access 10 Models

TL;DR

Gabliteration introduces an adaptive, multi-directional neural weight modification technique that effectively alters specific behaviors in large language models while preserving overall model quality.

Contribution

It proposes a novel method with dynamic layer optimization and regularized projections to improve behavioral modification in large language models.

Findings

01

Effective behavioral changes with minimal quality loss

02

Validated across models from 0.6B to 4B parameters

03

Available on Hugging Face for practical use

Abstract

We present Gabliteration, a novel neural weight modification technique that advances beyond traditional abliteration methods by implementing adaptive multi-directional projections with regularized layer selection. Our approach addresses the fundamental limitation of existing methods that compromise model quality while attempting to modify specific behavioral patterns. Through dynamic layer optimization, regularized projection matrices, and adaptive scaling mechanisms, we achieve theoretically superior weight modification while minimizing quality degradation in unrelated domains. We validate our method through the gabliterated-v1 model series (0.6B to 4B parameters) available on Hugging Face, demonstrating practical applicability across multiple model scales.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning