Unsupervised Generative Feature Transformation via Graph Contrastive Pre-training and Multi-objective Fine-tuning
Wangyang Ying, Dongjie Wang, Xuanming Hu, Yuanchun Zhou, Charu C., Aggarwal, Yanjie Fu

TL;DR
This paper introduces an unsupervised learning framework that combines graph contrastive pretraining and multi-objective fine-tuning to transform features effectively without labeled data, capturing complex interactions for improved data analysis.
Contribution
It proposes a novel unsupervised paradigm for feature transformation that integrates graph contrastive learning, generative modeling, and a new utility measurement, addressing limitations of prior methods.
Findings
Developed a feature value consistency metric for utility evaluation.
Created a graph contrastive encoder for feature set representation.
Designed a deep generative model for feature transformation.
Abstract
Feature transformation is to derive a new feature set from original features to augment the AI power of data. In many science domains such as material performance screening, while feature transformation can model material formula interactions and compositions and discover performance drivers, supervised labels are collected from expensive and lengthy experiments. This issue motivates an Unsupervised Feature Transformation Learning (UFTL) problem. Prior literature, such as manual transformation, supervised feedback guided search, and PCA, either relies on domain knowledge or expensive supervised feedback, or suffers from large search space, or overlooks non-linear feature-feature interactions. UFTL imposes a major challenge on existing methods: how to design a new unsupervised paradigm that captures complex feature interactions and avoids large search space? To fill this gap, we connect…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Text and Document Classification Technologies
MethodsSparse Evolutionary Training · Principal Components Analysis · Contrastive Learning
