MOD-CL: Multi-label Object Detection with Constrained Loss

Sota Moriyama; Koji Watanabe; Katsumi Inoue; Akihiro Takemura

arXiv:2403.07885·cs.CV·March 14, 2024·1 cites

MOD-CL: Multi-label Object Detection with Constrained Loss

Sota Moriyama, Koji Watanabe, Katsumi Inoue, Akihiro Takemura

PDF

Open Access 1 Repo

TL;DR

This paper presents MOD-CL, a multi-label object detection framework that employs constrained loss functions to improve output adherence to specified requirements, building upon YOLOv8 and introducing new post-processing models.

Contribution

The paper introduces MOD-CL with constrained loss integration and new post-processing models, enhancing multi-label object detection accuracy and output conformity.

Findings

01

Improved detection scores with constrained loss implementation.

02

Effective post-processing models for constrained outputs.

03

Enhanced multi-label detection performance.

Abstract

We introduce MOD-CL, a multi-label object detection framework that utilizes constrained loss in the training process to produce outputs that better satisfy the given requirements. In this paper, we use $MO D_{YOLO}$ , a multi-label object detection model built upon the state-of-the-art object detection model YOLOv8, which has been published in recent years. In Task 1, we introduce the Corrector Model and Blender Model, two new models that follow after the object detection process, aiming to generate a more constrained output. For Task 2, constrained losses have been incorporated into the $MO D_{YOLO}$ architecture using Product T-Norm. The results show that these implementations are instrumental to improving the scores for both Task 1 and Task 2.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sotam2369/MOD-CL
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Text and Document Classification Technologies

MethodsYou Only Look Once · RoIAlign · Softmax · RoIPool