Extract and Merge: Merging extracted humans from different images utilizing Mask R-CNN
Asati Minkesh, Kraisittipong Worranitta, Miyachi Taizo

TL;DR
This paper presents an efficient Mask R-CNN based application that automatically extracts human figures from multiple images or videos and merges them into new scenes, streamlining photo editing tasks.
Contribution
It introduces a fast, scalable method for extracting and merging human instances from images and videos without additional overhead, enhancing photo editing automation.
Findings
Runs at 5 frames per second for real-time performance.
Handles multiple images and videos of varying lengths.
Enables automatic grouping and background merging of human figures.
Abstract
Selecting human objects out of the various type of objects in images and merging them with other scenes is manual and day-to-day work for photo editors. Although recently Adobe photoshop released "select subject" tool which automatically selects the foreground object in an image, but still requires fine manual tweaking separately. In this work, we proposed an application utilizing Mask R-CNN (for object detection and mask segmentation) that can extract human instances from multiple images and merge them with a new background. This application does not add any overhead to Mask R-CNN, running at 5 frames per second. It can extract human instances from any number of images or videos from merging them together. We also structured the code to accept videos of different lengths as input and length of the output-video will be equal to the longest input-video. We wanted to create a simple yet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Face recognition and analysis
MethodsRegion Proposal Network · Softmax · Convolution · RoIAlign · Mask R-CNN
