Spatial Memory for Context Reasoning in Object Detection
Xinlei Chen, Abhinav Gupta

TL;DR
This paper introduces a Spatial Memory Network (SMN) that enhances object detection by modeling instance-level spatial context and object-object relationships, leading to improved detection accuracy.
Contribution
The paper proposes a novel Spatial Memory Network that assembles object instances into a pseudo-image for better spatial reasoning in object detection.
Findings
Achieved 2.2% improvement over baseline Faster R-CNN on COCO.
Effectively models spatial layout for object-object relationship reasoning.
Provides a new sequential reasoning architecture for object detection.
Abstract
Modeling instance-level context and object-object relationships is extremely challenging. It requires reasoning about bounding boxes of different classes, locations \etc. Above all, instance-level spatial reasoning inherently requires modeling conditional distributions on previous detections. Unfortunately, our current object detection systems do not have any {\bf memory} to remember what to condition on! The state-of-the-art object detectors still detect all object in parallel followed by non-maximal suppression (NMS). While memory has been used for tasks such as captioning, they mostly use image-level memory cells without capturing the spatial layout. On the other hand, modeling object-object relationships requires {\bf spatial} reasoning -- not only do we need a memory to store the spatial layout, but also a effective reasoning module to extract spatial patterns. This paper presents…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Robotics and Sensor-Based Localization
MethodsMemory Network
