Understanding Concept Identification as Consistent Data Clustering Across Multiple Feature Spaces
Felix Lanfermann, Sebastian Schmitt, Patricia Wollstadt

TL;DR
This paper presents a novel approach to concept identification in large datasets, emphasizing consistency across multiple feature subsets, and demonstrates its advantages over traditional clustering methods in interpretability and decision support.
Contribution
It introduces a new perspective on concept identification as a specialized clustering task and proposes the use of mutual information to evaluate cluster consistency across feature subsets.
Findings
Identified clusters are more interpretable in energy management data.
Proposed method outperforms classical clustering in consistency across feature subsets.
Mutual information effectively measures cluster stability across features.
Abstract
Identifying meaningful concepts in large data sets can provide valuable insights into engineering design problems. Concept identification aims at identifying non-overlapping groups of design instances that are similar in a joint space of all features, but which are also similar when considering only subsets of features. These subsets usually comprise features that characterize a design with respect to one specific context, for example, constructive design parameters, performance values, or operation modes. It is desirable to evaluate the quality of design concepts by considering several of these feature subsets in isolation. In particular, meaningful concepts should not only identify dense, well separated groups of data instances, but also provide non-overlapping groups of data that persist when considering pre-defined feature subsets separately. In this work, we propose to view concept…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Criteria Decision Making · Advanced Multi-Objective Optimization Algorithms
