MMCL: Correcting Content Query Distributions for Improved Anti-Overlapping X-Ray Object Detection

Mingyuan Li; Tong Jia; Hui Lu; Hao Wang; Bowen Ma; Shiyi Guo; Shuyang Lin; Dongyue Chen; Haoran Wang; Baosheng Yu

arXiv:2406.03176·cs.CV·November 12, 2025·1 cites

MMCL: Correcting Content Query Distributions for Improved Anti-Overlapping X-Ray Object Detection

Mingyuan Li, Tong Jia, Hui Lu, Hao Wang, Bowen Ma, Shiyi Guo, Shuyang Lin, Dongyue Chen, Haoran Wang, Baosheng Yu

PDF

Open Access 1 Repo

TL;DR

This paper introduces MMCL, a contrastive learning framework that improves X-ray object detection by balancing content query distributions, leading to better separation of overlapping objects and state-of-the-art results.

Contribution

The paper proposes a novel multi-class min-margin contrastive learning method to correct content query distributions for enhanced anti-overlapping object detection in X-ray images.

Findings

01

MMCL improves detection accuracy on three X-ray datasets.

02

The method achieves state-of-the-art performance across multiple backbone networks.

03

Enhanced intra-class diversity and inter-class separability are demonstrated.

Abstract

Unlike natural images with occlusion-based overlap, X-ray images exhibit depth-induced superimposition and semi-transparent appearances, where objects at different depths overlap and their features blend together. These characteristics demand specialized mechanisms to disentangle mixed representations between target objects (e.g., prohibited items) and irrelevant backgrounds. While recent studies have explored adapting detection transformers (DETR) for anti-overlapping object detection, the importance of well-distributed content queries that represent object hypotheses remains underexplored. In this paper, we introduce a multi-class min-margin contrastive learning (MMCL) framework to correct the distribution of content queries, achieving balanced intra-class diversity and inter-class separability. The framework first groups content queries by object category and then applies two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anonymity0403/mmcl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Speech Recognition and Synthesis · Topic Modeling

MethodsSoftmax · Layer Normalization · Linear Layer · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Label Smoothing · Adam · Attention Is All You Need · Residual Connection · Multi-Head Attention