# Transfer Representation Learning with TSK Fuzzy System

**Authors:** Peng Xu, Zhaohong Deng, Jun Wang, Qun Zhang, Shitong Wang

arXiv: 1901.02703 · 2019-01-10

## TL;DR

This paper introduces a transfer learning method using TSK fuzzy systems to create an interpretable, effective feature transformation that minimizes distribution differences between domains without relying on complex kernel functions.

## Contribution

The paper proposes a novel transfer representation learning approach with TSK fuzzy systems, enhancing interpretability and effectiveness over kernel-based methods.

## Key findings

- Outperforms kernel-based methods on text and image datasets.
- Provides more interpretable transfer learning models.
- Achieves better domain distribution alignment.

## Abstract

Transfer learning can address the learning tasks of unlabeled data in the target domain by leveraging plenty of labeled data from a different but related source domain. A core issue in transfer learning is to learn a shared feature space in where the distributions of the data from two domains are matched. This learning process can be named as transfer representation learning (TRL). The feature transformation methods are crucial to ensure the success of TRL. The most commonly used feature transformation method in TRL is kernel-based nonlinear mapping to the high-dimensional space followed by linear dimensionality reduction. But the kernel functions are lack of interpretability and are difficult to be selected. To this end, the TSK fuzzy system (TSK-FS) is combined with transfer learning and a more intuitive and interpretable modeling method, called transfer representation learning with TSK-FS (TRL-TSK-FS) is proposed in this paper. Specifically, TRL-TSK-FS realizes TRL from two aspects. On one hand, the data in the source and target domains are transformed into the fuzzy feature space in which the distribution distance of the data between two domains is min-imized. On the other hand, discriminant information and geo-metric properties of the data are preserved by linear discriminant analysis and principal component analysis. In addition, another advantage arises with the proposed method, that is, the nonlinear transformation is realized by constructing fuzzy mapping with the antecedent part of the TSK-FS instead of kernel functions which are difficult to be selected. Extensive experiments are conducted on the text and image datasets. The results obviously show the superiority of the proposed method.

---
Source: https://tomesphere.com/paper/1901.02703