Gabliteration: Adaptive Multi-Directional Neural Weight Modification for Selective Behavioral Alteration in Large Language Models
G\"okdeniz G\"ulmez

TL;DR
Gabliteration introduces an adaptive, multi-directional neural weight modification technique that effectively alters specific behaviors in large language models while preserving overall model quality.
Contribution
It proposes a novel method with dynamic layer optimization and regularized projections to improve behavioral modification in large language models.
Findings
Effective behavioral changes with minimal quality loss
Validated across models from 0.6B to 4B parameters
Available on Hugging Face for practical use
Abstract
We present Gabliteration, a novel neural weight modification technique that advances beyond traditional abliteration methods by implementing adaptive multi-directional projections with regularized layer selection. Our approach addresses the fundamental limitation of existing methods that compromise model quality while attempting to modify specific behavioral patterns. Through dynamic layer optimization, regularized projection matrices, and adaptive scaling mechanisms, we achieve theoretically superior weight modification while minimizing quality degradation in unrelated domains. We validate our method through the gabliterated-v1 model series (0.6B to 4B parameters) available on Hugging Face, demonstrating practical applicability across multiple model scales.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Goekdeniz-Guelmez/Nanbeige4-3B-Thinking-2511-gabliteratedmodel· 8 dl· ♡ 38 dl♡ 3
- 🤗JeffGreen311/eve-qwen3-8b-consciousness-liberatedmodel· 627 dl· ♡ 1627 dl♡ 1
- 🤗Goekdeniz-Guelmez/Josiefied-Qwen3.5-0.8B-gabliterated-v1model· 1.4k dl· ♡ 71.4k dl♡ 7
- 🤗Goekdeniz-Guelmez/Qwen3-4B-Instruct-2507-gabliteratedmodel· 316 dl· ♡ 10316 dl♡ 10
- 🤗Goekdeniz-Guelmez/Gemma-3-1b-it-gabliteratedmodel· 4 dl4 dl
- 🤗Goekdeniz-Guelmez/Llama-3.2-1B-Instruct-gabliteratedmodel· 1 dl1 dl
- 🤗Goekdeniz-Guelmez/Qwen3-0.6B-gabliteratedmodel· 74 dl· ♡ 374 dl♡ 3
- 🤗Goekdeniz-Guelmez/Qwen3-4B-Thinking-2507-gabliteratedmodel· 64 dl· ♡ 864 dl♡ 8
- 🤗Goekdeniz-Guelmez/Granite-4.0-350m-gabliteratedmodel· 10 dl· ♡ 110 dl♡ 1
- 🤗Goekdeniz-Guelmez/Qwen3-4B-Sky-High-Hermes-gabliteratedmodel· 64 dl· ♡ 1264 dl♡ 12
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
