MONA: Moving Object Detection from Videos Shot by Dynamic Camera

Boxun Hu; Mingze Xia; Ding Zhao; Guanlin Wu

arXiv:2501.13183·cs.CV·January 24, 2025

MONA: Moving Object Detection from Videos Shot by Dynamic Camera

Boxun Hu, Mingze Xia, Ding Zhao, Guanlin Wu

PDF

Open Access

TL;DR

MONA is a new framework that improves moving object detection and segmentation in videos captured by dynamic cameras, addressing challenges posed by camera and object motion in urban environments.

Contribution

MONA introduces a novel two-module approach combining dynamic points extraction and adaptive segmentation, advancing moving object detection in videos with moving cameras.

Findings

01

Achieves state-of-the-art results on MPI Sintel dataset.

02

Effectively distinguishes camera motion from object motion.

03

Enhances urban environment analysis applications.

Abstract

Dynamic urban environments, characterized by moving cameras and objects, pose significant challenges for camera trajectory estimation by complicating the distinction between camera-induced and object motion. We introduce MONA, a novel framework designed for robust moving object detection and segmentation from videos shot by dynamic cameras. MONA comprises two key modules: Dynamic Points Extraction, which leverages optical flow and tracking any point to identify dynamic points, and Moving Object Segmentation, which employs adaptive bounding box filtering, and the Segment Anything for precise moving object segmentation. We validate MONA by integrating with the camera trajectory estimation method LEAP-VO, and it achieves state-of-the-art results on the MPI Sintel dataset comparing to existing methods. These results demonstrate MONA's effectiveness for moving object detection and its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications