Boosting Mask R-CNN Performance for Long, Thin Forensic Traces with Pre-Segmentation and IoU Region Merging
Moritz Zink, Martin Schiele, Pengcheng Fan, Stephan Gasterst\"adt

TL;DR
This paper enhances Mask R-CNN's ability to segment long, thin forensic traces by integrating pre-segmentation with PSPNet and novel training strategies for better accuracy and robustness.
Contribution
It introduces a pre-segmentation approach with PSPNet and custom training heuristics to improve Mask R-CNN's performance on challenging forensic traces.
Findings
Significant improvement in segmentation accuracy for long, thin objects.
Enhanced robustness and generalization across diverse forensic images.
Development of specialized cost functions and heuristics for training.
Abstract
Mask R-CNN has recently achieved great success in the field of instance segmentation. However, weaknesses of the algorithm have been repeatedly pointed out as well, especially in the segmentation of long, sparse objects whose orientation is not exclusively horizontal or vertical. We present here an approach that significantly improves the performance of the algorithm by first pre-segmenting the images with a PSPNet algorithm. To further improve its prediction, we have developed our own cost functions and heuristics in the form of training strategies, which can prevent so-called (early) overfitting and achieve a more targeted convergence. Furthermore, due to the high variance of the images, especially for PSPNet, we aimed to develop strategies for a high robustness and generalization, which are also presented here.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeophysical Methods and Applications · Advanced Neural Network Applications · Digital Media Forensic Detection
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Auxiliary Classifier · Dilated Convolution · Batch Normalization · Average Pooling · Convolution · Pyramid Pooling Module · PSPNet
