Feature-Based Lie Group Transformer for Real-World Applications
Takayuki Komatsu, Yoshiyuki Ohmura, Kayato Nishitsunoi, and Yasuo Kuniyoshi

TL;DR
This paper introduces a feature-based Lie group transformer that combines group decomposition with feature extraction and segmentation to improve real-world object recognition, addressing limitations of previous low-resolution, background-free models.
Contribution
It extends previous group decomposition methods by integrating feature extraction and segmentation, enabling application to complex real-world images with backgrounds.
Findings
Validated on real-world dataset with objects and backgrounds
Demonstrated improved object recognition capabilities
Bridged gap between theoretical group methods and practical applications
Abstract
The main goal of representation learning is to acquire meaningful representations from real-world sensory inputs without supervision. Representation learning explains some aspects of human development. Various neural network (NN) models have been proposed that acquire empirically good representations. However, the formulation of a good representation has not been established. We recently proposed a method for categorizing changes between a pair of sensory inputs. A unique feature of this approach is that transformations between two sensory inputs are learned to satisfy algebraic structural constraints. Conventional representation learning often assumes that disentangled independent feature axes is a good representation; however, we found that such a representation cannot account for conditional independence. To overcome this problem, we proposed a new method using group decomposition in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Ferroelectric and Negative Capacitance Devices · Visual Attention and Saliency Detection
