MM-ReCoder: Advancing Chart-to-Code Generation with Reinforcement Learning and Self-Correction

Zitian Tang; Xu Zhang; Jianbo Yuan; Yang Zou; Varad Gunjal; Songyao Jiang; Davide Modolo

arXiv:2604.01600·cs.AI·April 3, 2026

MM-ReCoder: Advancing Chart-to-Code Generation with Reinforcement Learning and Self-Correction

Zitian Tang, Xu Zhang, Jianbo Yuan, Yang Zou, Varad Gunjal, Songyao Jiang, Davide Modolo

PDF

TL;DR

MM-ReCoder is a novel chart-to-code generation model that leverages reinforcement learning and self-correction to produce more accurate and executable code, outperforming existing methods on multiple benchmarks.

Contribution

Introduces MM-ReCoder, a reinforcement learning-based chart-to-code model with self-correction, utilizing a two-stage multi-turn RL strategy for improved performance.

Findings

01

Achieves state-of-the-art results on three chart-to-code benchmarks.

02

Demonstrates effective self-correction and improved code accuracy.

03

Outperforms models trained solely with supervised fine-tuning.

Abstract

Multimodal Large Language Models (MLLMs) have recently demonstrated promising capabilities in multimodal coding tasks such as chart-to-code generation. However, existing methods primarily rely on supervised fine-tuning (SFT), which requires the model to learn code patterns through chart-code pairs but does not expose the model to a code execution environment. Moreover, while self-correction through execution feedback offers a potential route to improve coding quality, even state-of-the-art MLLMs have been shown to struggle with effective self-correction. In this work, we introduce MM-ReCoder, a chart-to-code generation model trained with reinforcement learning (RL) and equipped with self-correction ability. We propose a two-stage multi-turn self-correction RL strategy based on Group Relative Policy Optimization (GRPO). The first stage enhances the model's self-correction ability via…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.