Multimodal and Hyperspectral Dataset for Segmentation of Bulky Waste using VIS, IR, NIR, and Terahertz Imaging
Manuel Bihler, Lukas Roming, Dovilė Čibiraitė-Lukenskienė, Jochen Aderhold, Andreas Keil, Friedrich Schlüter, Robin Gruna, Michael Heizmann

TL;DR
This paper introduces a comprehensive dataset combining multiple imaging techniques to improve the classification and segmentation of bulky waste, especially distinguishing wood from non-wood materials.
Contribution
The novel contribution is a publicly available multimodal dataset with VIS, IR, NIR, and THz imaging for waste segmentation, including detailed annotations and benchmark tasks.
Findings
The dataset includes 56 registered scenes with 22,659 annotated patches for binary and subclass segmentation tasks.
Baseline performance using CNNs and fusion architectures is reported to establish reference metrics for future work.
The dataset includes challenging scenarios like occlusions and embedded metals to encourage robust multimodal approaches.
Abstract
This study presents an annotated multi-sensor, multimodal, and hyperspectral dataset designed to support deep learning-based classification and segmentation of bulky waste. The dataset comprises four distinct sensor modalities: high-resolution visible RGB images (VIS), hyperspectral near-infrared (NIR), temporally resolved thermal infrared (IR), and terahertz (THz) imaging with depth information, providing complementary multimodal information. An image registration process aligns all modalities to a common reference frame, enabling near pixel-precise fusion across sensors. WoodVIT contains 56 registered multi-sensor scenes, partitioned into 22,659 annotated patches with two main classes (wood and non-wood) and 16 subclass labels. It includes pixel-masks and patch-wise annotations to facilitate both segmentation and classification tasks. The primary benchmark task is binary…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Thermography and Photoacoustic Techniques · Remote-Sensing Image Classification
