Neuromorphic Vision-based Motion Segmentation with Graph Transformer   Neural Network

Yusra Alkendi; Rana Azzam; Sajid Javed; Lakmal Seneviratne; and Yahya; Zweiri

arXiv:2404.10940·cs.CV·April 18, 2024·1 cites

Neuromorphic Vision-based Motion Segmentation with Graph Transformer Neural Network

Yusra Alkendi, Rana Azzam, Sajid Javed, Lakmal Seneviratne, and Yahya, Zweiri

PDF

Open Access

TL;DR

This paper introduces GTNN, a novel Graph Transformer Neural Network for event-based motion segmentation that effectively captures spatiotemporal correlations in neuromorphic vision data, outperforming existing methods in dynamic environments.

Contribution

The work presents a new event-based motion segmentation algorithm using a Graph Transformer, along with a novel labeling scheme and a public dataset for benchmarking.

Findings

01

GTNN outperforms state-of-the-art methods in accuracy and detection rate.

02

The proposed training scheme enables efficient learning on large datasets.

03

GTNN maintains robustness in complex dynamic scenes with multiple moving objects.

Abstract

Moving object segmentation is critical to interpret scene dynamics for robotic navigation systems in challenging environments. Neuromorphic vision sensors are tailored for motion perception due to their asynchronous nature, high temporal resolution, and reduced power consumption. However, their unconventional output requires novel perception paradigms to leverage their spatially sparse and temporally dense nature. In this work, we propose a novel event-based motion segmentation algorithm using a Graph Transformer Neural Network, dubbed GTNN. Our proposed algorithm processes event streams as 3D graphs by a series of nonlinear transformations to unveil local and global spatiotemporal correlations between events. Based on these correlations, events belonging to moving objects are segmented from the background without prior knowledge of the dynamic scene geometry. The algorithm is trained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Medical Image Segmentation Techniques · Image and Video Stabilization

MethodsAttention Is All You Need · Laplacian EigenMap · Laplacian Positional Encodings · Dropout · Adam · Position-Wise Feed-Forward Layer · Linear Layer · Layer Normalization · Byte Pair Encoding · Absolute Position Encodings