JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence
Qiushi Sun, Jingyang Gong, Yang Liu, Qiaosheng Chen, Lei Li, Kai Chen, Qipeng Guo, Ben Kao, Fei Yuan

TL;DR
JanusCoder introduces a multimodal code dataset and models that generate code from visual and textual inputs, advancing visual-programmatic interfaces for code intelligence.
Contribution
The paper presents a large-scale multimodal code corpus and unified models that handle diverse visual and textual programming tasks, surpassing existing specialized approaches.
Findings
JanusCode-800K is the largest multimodal code dataset to date.
JanusCoder models achieve performance comparable to commercial models.
Extensive analysis reveals insights into harmonizing logic and visual expression.
Abstract
The scope of neural code intelligence is rapidly expanding beyond text-based source code to encompass the rich visual outputs that programs generate. This visual dimension is critical for advanced applications like flexible content generation and precise, program-driven editing of visualizations. However, progress has been impeded by the scarcity of high-quality multimodal code data, a bottleneck stemming from challenges in synthesis and quality assessment. To address these challenges, we make contributions from both a data and modeling perspective. We first introduce a complete synthesis toolkit that leverages reciprocal synergies between data modalities to efficiently produce a large-scale, high-quality corpus spanning from standard charts to complex interactive web UIs and code-driven animations. Leveraging this toolkit, we construct JanusCode-800K, the largest multimodal code corpus…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗internlm/JanusCoder-8Bmodel· 72 dl· ♡ 1372 dl♡ 13
- 🤗internlm/JanusCoder-14Bmodel· 69 dl· ♡ 3469 dl♡ 34
- 🤗internlm/JanusCoderV-7Bmodel· 118 dl· ♡ 14118 dl♡ 14
- 🤗internlm/JanusCoderV-8Bmodel· 141 dl· ♡ 13141 dl♡ 13
- 🤗cyankiwi/JanusCoder-14B-AWQ-4bitmodel· 2 dl2 dl
- 🤗cyankiwi/JanusCoder-14B-AWQ-8bitmodel· 2 dl2 dl
- 🤗cyankiwi/JanusCoder-8B-AWQ-8bitmodel· 1 dl1 dl
- 🤗cyankiwi/JanusCoder-8B-AWQ-4bitmodel· 1 dl1 dl
- 🤗unsloth/JanusCoder-8B-GGUFmodel· 186 dl· ♡ 3186 dl♡ 3
- 🤗unsloth/JanusCoder-8Bmodel· 5 dl5 dl
Videos
