Probabilistically Sampled and Spectrally Clustered Plant Genotypes using Phenotypic Characteristics
Aditya A. Shastri, Kapil Ahuja, Milind B. Ratnaparkhe, and Yann Busnel

TL;DR
This paper introduces a novel spectral clustering method with pivotal sampling for plant genotypes, significantly improving accuracy and reducing computational complexity over traditional hierarchical clustering methods.
Contribution
The paper proposes a new spectral clustering algorithm using pivotal sampling for phenotypic data, demonstrating superior accuracy and efficiency compared to existing methods.
Findings
SC with Pivotal Sampling outperforms HC in accuracy by up to 45%.
The new method reduces computational complexity by more than an order of magnitude.
Experimental results on soybean data validate the effectiveness of the proposed approach.
Abstract
Clustering genotypes based upon their phenotypic characteristics is used to obtain diverse sets of parents that are useful in their breeding programs. The Hierarchical Clustering (HC) algorithm is the current standard in clustering of phenotypic data. This algorithm suffers from low accuracy and high computational complexity issues. To address the accuracy challenge, we propose the use of Spectral Clustering (SC) algorithm. To make the algorithm computationally cheap, we propose using sampling, specifically, Pivotal Sampling that is probability based. Since application of samplings to phenotypic data has not been explored much, for effective comparison, another sampling technique called Vector Quantization (VQ) is adapted for this data as well. VQ has recently given promising results for genome data. The novelty of our SC with Pivotal Sampling algorithm is in constructing the crucial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoybean genetics and cultivation · Genetic Mapping and Diversity in Plants and Animals · Genetics and Plant Breeding
MethodsSpectral Clustering
