TL;DR
This paper advances AI waste detection by benchmarking models, optimizing prompts, fine-tuning detectors, and proposing a semi-supervised ensemble pseudo-labeling method that improves performance on real-world datasets.
Contribution
It establishes strong baselines, introduces an ensemble pseudo-labeling framework, and generates high-quality annotations for waste detection, addressing dataset limitations and domain-specific challenges.
Findings
Transformer-based detectors achieved 51.6 mAP.
Prompt optimization significantly improves zero-shot accuracy.
Pseudo-labeling surpasses fully supervised training results.
Abstract
Effective waste sorting is critical for sustainable recycling, yet AI research in this domain continues to lag behind commercial systems due to limited datasets and reliance on legacy object detectors. In this work, we advance AI-driven waste detection by establishing strong baselines and introducing an ensemble-based semi-supervised learning framework. We first benchmark state-of-the-art Open-Vocabulary Object Detection (OVOD) models on the real-world ZeroWaste dataset, demonstrating that while class-only prompts perform poorly, LLM-optimized prompts significantly enhance zero-shot accuracy. Next, to address domain-specific limitations, we fine-tune modern transformer-based detectors, achieving a new baseline of 51.6 mAP. We then propose a soft pseudo-labeling strategy that fuses ensemble predictions using spatial and consensus-aware weighting, enabling robust semi-supervised training.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
