TL;DR
This paper introduces Diagnostic-driven Progressive Evolution (DPE), a dynamic training method for large multimodal models that iteratively diagnoses weaknesses and guides targeted data generation for continual improvement.
Contribution
The paper proposes a novel iterative training paradigm that diagnoses model weaknesses and dynamically guides data augmentation and reinforcement, improving LMM performance.
Findings
DPE achieves stable, continual gains across eleven benchmarks.
DPE enables targeted reinforcement based on diagnosed weaknesses.
Experiments on Qwen models demonstrate effectiveness of the approach.
Abstract
As Large Multimodal Models (LMMs) scale up and reinforcement learning (RL) methods mature, LMMs have made notable progress in complex reasoning and decision making. Yet training still relies on static data and fixed recipes, making it difficult to diagnose capability blind spots or provide dynamic, targeted reinforcement. Motivated by findings that test driven error exposure and feedback based correction outperform repetitive practice, we propose Diagnostic-driven Progressive Evolution (DPE), a spiral loop where diagnosis steers data generation and reinforcement, and each iteration re-diagnoses the updated model to drive the next round of targeted improvement. DPE has two key components. First, multiple agents annotate and quality control massive unlabeled multimodal data, using tools such as web search and image editing to produce diverse, realistic samples. Second, DPE attributes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗hongruijia/Qwen2.5-VL-7B-Instruct_DPE_v1model· 1 dl· ♡ 21 dl♡ 2
- 🤗hongruijia/Qwen3_VL_8B_Instruct_DPE_v1model· 1 dl· ♡ 41 dl♡ 4
- 🤗hongruijia/Qwen3_VL_8B_Instruct_DPE_v2model· 3 dl· ♡ 23 dl♡ 2
- 🤗hongruijia/Qwen2.5-VL-7B-Instruct_DPE_v2model· 2 dl· ♡ 32 dl♡ 3
- 🤗hongruijia/Qwen3_VL_8B_Instruct_DPE_v3model· 5 dl· ♡ 75 dl♡ 7
- 🤗hongruijia/Qwen2.5-VL-7B-Instruct_DPE_v3model· 7 dl· ♡ 77 dl♡ 7
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
