Development of Image Collection Method Using YOLO and Siamese Network
Chan Young Shin, Ah Hyun Lee, Jun Young Lee, Ji Min Lee, Soo Jin Park

TL;DR
This paper presents a novel image collection method combining YOLO and Siamese networks to improve filtering accuracy and resource efficiency in automated data collection.
Contribution
The authors introduce a combined YOLO and Siamese network approach with adjustable thresholds, enhancing filtering accuracy and reducing resource usage in automated image data collection.
Findings
YOLO+Siamese network outperforms YOLO+MobileNet in F1 score (0.772 vs. 0.678).
Using cropped images improves Siamese network performance (82.31 vs. 80.94).
The method enables faster, higher-quality data collection with fewer resources.
Abstract
As we enter the era of big data, collecting high-quality data is very important. However, collecting data by humans is not only very time-consuming but also expensive. Therefore, many scientists have devised various methods to collect data using computers. Among them, there is a method called web crawling, but the authors found that the crawling method has a problem in that unintended data is collected along with the user. The authors found that this can be filtered using the object recognition model YOLOv10. However, there are cases where data that is not properly filtered remains. Here, image reclassification was performed by additionally utilizing the distance output from the Siamese network, and higher performance was recorded than other classification models. (average \_f1 score YOLO+MobileNet 0.678->YOLO+SiameseNet 0.772)) The user can specify a distance threshold to adjust the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification
MethodsSiamese Network
