Vector-based categorization analysis with improved principal component generated reference
Joseph M. Hamill, Matthias Arenz

TL;DR
This paper introduces an improved vector-based classification method using principal components to better identify clusters in large, variable datasets, demonstrated on molecular junction measurements.
Contribution
It proposes using the first principal component of the correlation matrix as a reference trace for enhanced clustering in vector analysis.
Findings
Principal component-based reference trace reveals two distinct groups in data.
Method improves cluster separation compared to traditional blank tunneling reference.
Supports objective hypothesis formulation in large data set analysis.
Abstract
With automated measurements, large data sets can be assembled with relative ease. Often these data sets have a large degree of variance. Nonetheless, we hope to find groups within the data set with shared distinguishing characteristics. Recently Albrecht, et al demonstrated a multi-parameter vector-based classification analysis which provided a powerful means to find clusters within a data set of conductance versus displacement traces measured from single molecular break junctions (Lemmer, et al, Nature Communications, 2016, 7, 12922). An idealized blank tunneling trace was used as a reference to calculate three vector variables. The authors suggested a more appropriate reference can be chosen to better form clusters in the data. Here we propose using a principal component of the correlation matrix calculated from the data set to construct a fast and strategic reference trace. Principal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMolecular Junctions and Nanostructures · Electrochemical Analysis and Applications · Electrochemical sensors and biosensors
