MIK: Modified Isolation Kernel for Biological Sequence Visualization, Classification, and Clustering
Sarwan Ali, Prakash Chourasia, Haris Mansoor, Bipin koirala, Murray, Patterson

TL;DR
This paper introduces the Modified Isolation Kernel (MIK), a new density-based similarity measure for t-SNE that improves local and global structure preservation in high-dimensional data visualization.
Contribution
It proposes MIK, an adaptive, robust kernel that enhances t-SNE's ability to capture local structures compared to traditional Gaussian kernels.
Findings
MIK outperforms Gaussian and isolation kernels in structure preservation.
Improved visualization of clusters and subclusters.
Enhanced computational efficiency with MIK.
Abstract
The t-Distributed Stochastic Neighbor Embedding (t-SNE) has emerged as a popular dimensionality reduction technique for visualizing high-dimensional data. It computes pairwise similarities between data points by default using an RBF kernel and random initialization (in low-dimensional space), which successfully captures the overall structure but may struggle to preserve the local structure efficiently. This research proposes a novel approach called the Modified Isolation Kernel (MIK) as an alternative to the Gaussian kernel, which is built upon the concept of the Isolation Kernel. MIK uses adaptive density estimation to capture local structures more accurately and integrates robustness measures. It also assigns higher similarity values to nearby points and lower values to distant points. Comparative research using the normal Gaussian kernel, the isolation kernel, and several…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Genetics, Bioinformatics, and Biomedical Research · Gene expression and cancer classification
MethodsPrincipal Components Analysis · Radial Basis Function
