Structure-Regularized Attention for Deformable Object Representation

Shenao Zhang; Li Shen; Zhifeng Li; Wei Liu

arXiv:2106.06672·cs.CV·June 15, 2021

Structure-Regularized Attention for Deformable Object Representation

Shenao Zhang, Li Shen, Zhifeng Li, Wei Liu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a structure-regularized attention mechanism that enhances deformable object representation by modeling intrinsic structural dependencies, improving performance and efficiency in neural networks.

Contribution

It proposes a novel structure-regularized attention method that formalizes feature interactions via structural factorization, seamlessly integrating into CNNs for better deformable object modeling.

Findings

01

Improves performance across multiple tasks.

02

Reduces model complexity compared to existing attention methods.

03

Captures diversified object part representations without extra supervision.

Abstract

Capturing contextual dependencies has proven useful to improve the representational power of deep neural networks. Recent approaches that focus on modeling global context, such as self-attention and non-local operation, achieve this goal by enabling unconstrained pairwise interactions between elements. In this work, we consider learning representations for deformable objects which can benefit from context exploitation by modeling the structural dependencies that the data intrinsically possesses. To this end, we provide a novel structure-regularized attention mechanism, which formalizes feature interaction as structural factorization through the use of a pair of light-weight operations. The instantiated building blocks can be directly incorporated into modern convolutional neural networks, to boost the representational power in an efficient manner. Comprehensive studies on multiple tasks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shenao-zhang/StRA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Face recognition and analysis · Human Pose and Action Recognition