Weakly Supervised Object Detection for Automatic Tooth-marked Tongue   Recognition

Yongcun Zhang; Jiajun Xu; Yina He; Shaozi Li; Zhiming Luo; Huangwei; Lei

arXiv:2408.16451·cs.CV·August 30, 2024

Weakly Supervised Object Detection for Automatic Tooth-marked Tongue Recognition

Yongcun Zhang, Jiajun Xu, Yina He, Shaozi Li, Zhiming Luo, Huangwei, Lei

PDF

Open Access 1 Repo

TL;DR

This paper introduces a fully automated weakly supervised method utilizing Vision Transformer and multiple instance learning for accurate detection and recognition of tooth-marked tongues in clinical images, improving objectivity and diagnostic precision in TCM.

Contribution

It presents a novel end-to-end weakly supervised approach combining ViT and MIL for tongue region detection and tooth-marked tongue recognition, with high accuracy and clinical relevance.

Findings

01

High accuracy in tooth-marked tongue classification

02

Effective localization of tooth-marked regions

03

Automated process improves diagnostic objectivity

Abstract

Tongue diagnosis in Traditional Chinese Medicine (TCM) is a crucial diagnostic method that can reflect an individual's health status. Traditional methods for identifying tooth-marked tongues are subjective and inconsistent because they rely on practitioner experience. We propose a novel fully automated Weakly Supervised method using Vision transformer and Multiple instance learning WSVM for tongue extraction and tooth-marked tongue recognition. Our approach first accurately detects and extracts the tongue region from clinical images, removing any irrelevant background information. Then, we implement an end-to-end weakly supervised object detection method. We utilize Vision Transformer (ViT) to process tongue images in patches and employ multiple instance loss to identify tooth-marked regions with only image-level annotations. WSVM achieves high accuracy in tooth-marked tongue…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yc-zh/wsvm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraditional Chinese Medicine Studies

MethodsLinear Layer · Adam · Layer Normalization · Attention Is All You Need · Position-Wise Feed-Forward Layer · Dense Connections · Residual Connection · Multi-Head Attention · Byte Pair Encoding · Absolute Position Encodings