SimVODIS: Simultaneous Visual Odometry, Object Detection, and Instance Segmentation
Ue-Hwan Kim, Se-Ho Kim, Jong-Hwan Kim

TL;DR
SimVODIS is a neural architecture that performs visual odometry, object detection, and instance segmentation simultaneously in a single thread, improving efficiency and performance for intelligent agents.
Contribution
It introduces a unified neural model that integrates geometric and semantic perception tasks using self-supervised learning from unlabeled videos.
Findings
Outperforms or matches state-of-the-art in pose estimation and depth prediction.
Achieves high accuracy in object detection and instance segmentation.
Operates efficiently in a single thread, reducing computational complexity.
Abstract
Intelligent agents need to understand the surrounding environment to provide meaningful services to or interact intelligently with humans. The agents should perceive geometric features as well as semantic entities inherent in the environment. Contemporary methods in general provide one type of information regarding the environment at a time, making it difficult to conduct high-level tasks. Moreover, running two types of methods and associating two resultant information requires a lot of computation and complicates the software architecture. To overcome these limitations, we propose a neural architecture that simultaneously performs both geometric and semantic tasks in a single thread: simultaneous visual odometry, object detection, and instance segmentation (SimVODIS). Training SimVODIS requires unlabeled video sequences and the photometric consistency between input image frames…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Neural Network Applications
