Complex Sequential Understanding through the Awareness of Spatial and   Temporal Concepts

Bo Pang; Kaiwen Zha; Hanwen Cao; Jiajun Tang; Minghui Yu; Cewu Lu

arXiv:2006.00212·cs.CV·June 2, 2020

Complex Sequential Understanding through the Awareness of Spatial and Temporal Concepts

Bo Pang, Kaiwen Zha, Hanwen Cao, Jiajun Tang, Minghui Yu, Cewu Lu

PDF

1 Repo

TL;DR

This paper introduces the Semi-Coupled Structure, a neural network approach that separately learns spatial and temporal concepts, enhancing performance on large-scale sequential tasks like image annotation, video recognition, and radar prediction.

Contribution

The paper proposes a novel Semi-Coupled Structure that explicitly decouples spatial and temporal learning in neural networks, improving their ability to handle complex sequential data.

Findings

01

Successfully annotates object outlines in images sequentially

02

Performs effective video action recognition

03

Predicts future radar images from observed sequences

Abstract

Understanding sequential information is a fundamental task for artificial intelligence. Current neural networks attempt to learn spatial and temporal information as a whole, limited their abilities to represent large scale spatial representations over long-range sequences. Here, we introduce a new modeling strategy called Semi-Coupled Structure (SCS), which consists of deep neural networks that decouple the complex spatial and temporal concepts learning. Semi-Coupled Structure can learn to implicitly separate input information into independent parts and process these parts respectively. Experiments demonstrate that a Semi-Coupled Structure can successfully annotate the outline of an object in images sequentially and perform video action recognition. For sequence-to-sequence problems, a Semi-Coupled Structure can predict future meteorological radar echo images based on observed images.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BoPang1996/Semi-Coupled-Structure-for-visual-sequental-tasks
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.