AlignTune: Modular Toolkit for Post-Training Alignment of Large Language Models

R E Zera Marveen Lyngkhoi; Chirag Chawla; Pratinav Seth; Utsav Avaiya; Soham Bhattacharjee; Mykola Khandoga; Rui Yuan; Vinay Kumar Sankarapu

arXiv:2602.09621·cs.CL·February 12, 2026

AlignTune: Modular Toolkit for Post-Training Alignment of Large Language Models

R E Zera Marveen Lyngkhoi, Chirag Chawla, Pratinav Seth, Utsav Avaiya, Soham Bhattacharjee, Mykola Khandoga, Rui Yuan, Vinay Kumar Sankarapu

PDF

Open Access

TL;DR

AlignTune is a modular toolkit that streamlines post-training alignment of large language models by standardizing workflows, enabling reproducible experiments, and supporting flexible optimization methods.

Contribution

It introduces a unified, extensible toolkit for LLM alignment that addresses backend interference and reproducibility issues in current practices.

Findings

01

Standardizes alignment workflows across different backends

02

Enables controlled, reproducible experiments in LLM alignment

03

Supports both supervised fine-tuning and RLHF-style optimization

Abstract

Post-training alignment is central to deploying large language models (LLMs), yet practical workflows remain split across backend-specific tools and ad-hoc glue code, making experiments hard to reproduce. We identify backend interference, reward fragmentation, and irreproducible pipelines as key obstacles in alignment research. We introduce AlignTune, a modular toolkit exposing a unified interface for supervised fine-tuning (SFT) and RLHF-style optimization with interchangeable TRL and Unsloth backends. AlignTune standardizes configuration, provides an extensible reward layer (rule-based and learned), and integrates evaluation over standard benchmarks and custom tasks. By isolating backend-specific logic behind a single factory boundary, AlignTune enables controlled comparisons and reproducible alignment experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Artificial Intelligence in Healthcare and Education · Topic Modeling