Using Motion and Internal Supervision in Object Recognition
Daniel Harari

TL;DR
This thesis investigates how motion information and internal supervision can enhance unsupervised object recognition in dynamic scenes, focusing on adaptive detection, hand recognition, and gaze direction extraction.
Contribution
It introduces novel methods leveraging motion for adaptive object-part detection and unsupervised learning of hands and gaze direction, inspired by infant learning processes.
Findings
Motion aids adaptive object-part detection in dynamic scenes.
Unsupervised methods successfully recognize hands by appearance and context.
Gaze direction can be extracted using motion cues in unsupervised learning.
Abstract
In this thesis we address two related aspects of visual object recognition: the use of motion information, and the use of internal supervision, to help unsupervised learning. These two aspects are inter-related in the current study, since image motion is used for internal supervision, via the detection of spatiotemporal events of active-motion and the use of tracking. Most current work in object recognition deals with static images during both learning and recognition. In contrast, we are interested in a dynamic scene where visual processes, such as detecting motion events and tracking, contribute spatiotemporal information, which is useful for object attention, motion segmentation, 3-D understanding and object interactions. We explore the use of these sources of information in both learning and recognition processes. In the first part of the work, we demonstrate how motion can be used…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization · Advanced Vision and Imaging
