ALET (Automated Labeling of Equipment and Tools): A Dataset, a Baseline and a Usecase for Tool Detection in the Wild
Fatih Can Kurnaz, Burak Hocao\u{g}lu, Mert Kaan Y{\i}lmaz, \.Idil, S\"ulo, and Sinan Kalkan (KOVAN Research Lab, Dept. of Computer Engineering,, Middle East Technical University, Ankara, Turkey)

TL;DR
This paper introduces METU-ALET, a comprehensive dataset for tool detection in real-world environments, evaluates several state-of-the-art detectors, and highlights challenges in detecting small or similar-looking tools, supporting future robotics research.
Contribution
Provides the first extensive dataset for tool detection in diverse real-world environments and benchmarks multiple deep detectors on this dataset.
Findings
Detectors struggle with small and visually similar tools.
The dataset reveals challenges like occlusion and articulation.
Baseline models provide a starting point for future research.
Abstract
Robots collaborating with humans in realistic environments will need to be able to detect the tools that can be used and manipulated. However, there is no available dataset or study that addresses this challenge in real settings. In this paper, we fill this gap by providing an extensive dataset (METU-ALET) for detecting farming, gardening, office, stonemasonry, vehicle, woodworking and workshop tools. The scenes correspond to sophisticated environments with or without humans using the tools. The scenes we consider introduce several challenges for object detection, including the small scale of the tools, their articulated nature, occlusion, inter-class invariance, etc. Moreover, we train and compare several state of the art deep object detectors (including Faster R-CNN, Cascade R-CNN, RepPoint and RetinaNet) on our dataset. We observe that the detectors have difficulty in detecting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization
MethodsRegion Proposal Network · Convolution · RoIPool · Softmax · Faster R-CNN
