X-ray illicit object detection using hybrid CNN-transformer neural network architectures
Jorgen Cani, Christos Diou, Spyridon Evangelatos, Panagiotis, Radoglou-Grammatikis, Vasileios Argyriou, Panagiotis Sarigiannidis, Iraklis, Varlamis, Georgios Th. Papadopoulos

TL;DR
This study evaluates hybrid CNN-transformer neural network architectures for X-ray object detection, demonstrating increased robustness over traditional CNNs in challenging, occluded, and domain-shifted scenarios.
Contribution
It introduces and compares hybrid CNN-transformer architectures with CNN baselines for X-ray object detection across multiple datasets.
Findings
Hybrid architectures show increased robustness under domain shifts.
YOLOv8 with CNN backbone performs best on some datasets.
Hybrid models outperform CNNs in occlusion and concealment scenarios.
Abstract
In the field of X-ray security applications, even the smallest details can significantly impact outcomes. Objects that are heavily occluded or intentionally concealed pose a great challenge for detection, whether by human observation or through advanced technological applications. While certain Deep Learning (DL) architectures demonstrate strong performance in processing local information, such as Convolutional Neural Networks (CNNs), others excel in handling distant information, e.g., transformers. In X-ray security imaging the literature has been dominated by the use of CNN-based methods, while the integration of the two aforementioned leading architectures has not been sufficiently explored. In this paper, various hybrid CNN-transformer architectures are evaluated against a common CNN object detection baseline, namely YOLOv8. In particular, a CNN (HGNetV2) and a hybrid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsYou Only Look Once
