IOSVLM: A 3D Vision-Language Model for Unified Dental Diagnosis from Intraoral Scans
Huimin Xiong, Zijie Meng, Tianxiang Hu, Chenyi Zhou, Yang Feng, Zuozhu Liu

TL;DR
IOSVLM introduces a novel 3D vision-language model for dental diagnosis from intraoral scans, leveraging native 3D geometry and a large dataset to improve accuracy and enable visual question-answering.
Contribution
The paper presents IOSVLM, the first end-to-end 3D VLM for dental diagnosis from intraoral scans, with a new dataset and geometry-to-chromatic proxy for improved 3D perception.
Findings
Achieves at least +9.58% macro accuracy over baselines.
Demonstrates effective 3D geometry modeling for dental diagnosis.
Enables visual question-answering on intraoral scans.
Abstract
3D intraoral scans (IOS) are increasingly adopted in routine dentistry due to abundant geometric evidence, and unified multi-disease diagnosis is desirable for clinical documentation and communication. While recent works introduce dental vision-language models (VLMs) to enable unified diagnosis and report generation on 2D images or multi-view images rendered from IOS, they do not fully leverage native 3D geometry. Such work is necessary and also challenging, due to: (i) heterogeneous scan forms and the complex IOS topology, (ii) multi-disease co-occurrence with class imbalance and fine-grained morphological ambiguity, (iii) limited paired 3D IOS-text data. Thus, we present IOSVLM, an end-to-end 3D VLM that represents scans as point clouds and follows a 3D encoder-projector-LLM design for unified diagnosis and generative visual question-answering (VQA), together with IOSVQA, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Face recognition and analysis
