DERM-3R: A Resource-Efficient Multimodal Agents Framework for Dermatologic Diagnosis and Treatment in Real-World Clinical Settings

Ziwen Chen; Zhendong Wang; Chongjing Wang; Yurui Dong; Luozhijie Jin; Jihao Gu; Kui Chen; Jiaxi Yang; Bingjie Lu; Zhou Zhang; Jirui Dai; Changyong Luo; Xiameng Gai; Haibing Lan; Zhi Liu

arXiv:2604.09596·cs.AI·April 14, 2026

DERM-3R: A Resource-Efficient Multimodal Agents Framework for Dermatologic Diagnosis and Treatment in Real-World Clinical Settings

Ziwen Chen, Zhendong Wang, Chongjing Wang, Yurui Dong, Luozhijie Jin, Jihao Gu, Kui Chen, Jiaxi Yang, Bingjie Lu, Zhou Zhang, Jirui Dai, Changyong Luo, Xiameng Gai, Haibing Lan, Zhi Liu

PDF

TL;DR

DERM-3R is a resource-efficient, multimodal agent framework that models traditional Chinese medicine dermatologic diagnosis and treatment, achieving strong performance with limited data and compute resources.

Contribution

It introduces a novel multi-agent system built on a lightweight multimodal LLM, tailored for TCM dermatology, with minimal data fine-tuning, outperforming larger general models.

Findings

01

DERM-3R performs well on dermatologic reasoning tasks.

02

It matches or surpasses large general-purpose multimodal models.

03

The framework is effective with limited data and compute.

Abstract

Dermatologic diseases impose a large and growing global burden, affecting billions and substantially reducing quality of life. While modern therapies can rapidly control acute symptoms, long-term outcomes are often limited by single-target paradigms, recurrent courses, and insufficient attention to systemic comorbidities. Traditional Chinese medicine (TCM) provides a complementary holistic approach via syndrome differentiation and individualized treatment, but practice is hindered by non-standardized knowledge, incomplete multimodal records, and poor scalability of expert reasoning. We propose DERM-3R, a resource-efficient multimodal agent framework to model TCM dermatologic diagnosis and treatment under limited data and compute. Based on real-world workflows, we reformulate decision-making into three core issues: fine-grained lesion recognition, multi-view lesion representation with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.