# Fast data sorting with modified principal component analysis to   distinguish unique single molecular break junction trajectories

**Authors:** J.M. Hamill, X.T. Zhao, G. M\'esz\'aros, M.R. Bryce, and M. Arenz

arXiv: 1705.06161 · 2018-01-10

## TL;DR

This paper introduces a rapid, objective PCA-based method for sorting large multivariate datasets, demonstrated on molecular conductance data to distinguish different molecular configurations and types.

## Contribution

The paper presents a novel PCA-based analysis technique that automates data sorting without prior hypotheses, improving speed and objectivity over existing correlation matrix methods.

## Key findings

- Successfully sorted molecular conductance data into distinct groups
- Automatically distinguished between different molecular types and geometries
- Applicable to various multivariate datasets

## Abstract

A simple and fast analysis method to sort large data sets into groups with shared distinguishing characteristics is described, and applied to single molecular break junction conductance versus electrode displacement data. The method, based on principal component analysis, successfully sorted data sets based on the projection of the data onto the first or second principal component of the correlation matrix without the need to assert any specific hypothesis about the expected features within the data. This was an improvement on the current correlation matrix analysis approach because it sorted data automatically, making it more objective and less time consuming, and our method is applicable to a wide range of multivariate data sets. Here the method was demonstrated on two systems. First, it was demonstrated on mixtures of two molecules with identical anchor groups, similar lengths, but either a $\pi$ (high conductance) or $\sigma$ (low conductance) bridge. The mixed data was automatically sorted into two groups containing one molecule or the other. Second, it was demonstrated on break junction data measured with the $\pi$ bridged molecule alone. Again the method distinguished between two groups. These groups were tentatively assigned to different geometries of the molecule in the junction.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1705.06161/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/1705.06161/full.md

## References

6 references — full list in the complete paper: https://tomesphere.com/paper/1705.06161/full.md

---
Source: https://tomesphere.com/paper/1705.06161