Feature-Based Lie Group Transformer for Real-World Applications

Takayuki Komatsu; Yoshiyuki Ohmura; Kayato Nishitsunoi; and Yasuo Kuniyoshi

arXiv:2506.04668·cs.CV·June 10, 2025

Feature-Based Lie Group Transformer for Real-World Applications

Takayuki Komatsu, Yoshiyuki Ohmura, Kayato Nishitsunoi, and Yasuo Kuniyoshi

PDF

Open Access

TL;DR

This paper introduces a feature-based Lie group transformer that combines group decomposition with feature extraction and segmentation to improve real-world object recognition, addressing limitations of previous low-resolution, background-free models.

Contribution

It extends previous group decomposition methods by integrating feature extraction and segmentation, enabling application to complex real-world images with backgrounds.

Findings

01

Validated on real-world dataset with objects and backgrounds

02

Demonstrated improved object recognition capabilities

03

Bridged gap between theoretical group methods and practical applications

Abstract

The main goal of representation learning is to acquire meaningful representations from real-world sensory inputs without supervision. Representation learning explains some aspects of human development. Various neural network (NN) models have been proposed that acquire empirically good representations. However, the formulation of a good representation has not been established. We recently proposed a method for categorizing changes between a pair of sensory inputs. A unique feature of this approach is that transformations between two sensory inputs are learned to satisfy algebraic structural constraints. Conventional representation learning often assumes that disentangled independent feature axes is a good representation; however, we found that such a representation cannot account for conditional independence. To overcome this problem, we proposed a new method using group decomposition in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopological and Geometric Data Analysis · Ferroelectric and Negative Capacitance Devices · Visual Attention and Saliency Detection