TL;DR
DARE is an open framework that unifies post-training and evaluation methods for diffusion large language models, facilitating research, comparison, and deployment across various model families.
Contribution
It introduces a comprehensive, unified platform built on verl and OpenCompass for post-training, reinforcement learning, and evaluation of dLLMs, addressing fragmentation issues.
Findings
DARE enables broad algorithmic coverage across multiple dLLM models.
It provides reproducible benchmark evaluation and practical acceleration.
Empirical results demonstrate DARE's effectiveness as a research and deployment tool.
Abstract
Diffusion large language models (dLLMs) are emerging as a compelling alternative to dominant autoregressive models, replacing strictly sequential token generation with iterative denoising and parallel generation dynamics. However, their open-source ecosystem remains fragmented across model families and, in particular, across post-training pipelines, where reinforcement learning objectives, rollout implementations and evaluation scripts are often released as paper-specific codebases. This fragmentation slows research iteration, raises the engineering burden of reproduction, and makes fair comparison across algorithms difficult. We present \textbf{DARE} (\textbf{d}LLMs \textbf{A}lignment and \textbf{R}einforcement \textbf{E}xecutor), an open framework for post-training and evaluating dLLMs. Built on top of verl~\cite{sheng2024hybridflow} and OpenCompass~\cite{2023opencompass}, DARE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
