Graph-Based Bidirectional Transformer Decision Threshold Adjustment Algorithm for Class-Imbalanced Molecular Data
Nicole Hayes, Ekaterina Merkurjev, Guo-Wei Wei

TL;DR
This paper introduces the BTDT-MBO algorithm, combining bidirectional transformers, MBO approaches, and decision threshold adjustments to improve classification accuracy on highly imbalanced molecular datasets, outperforming existing methods.
Contribution
It presents a novel algorithm integrating transformers, MBO, and threshold adjustments specifically designed for class-imbalanced molecular data classification.
Findings
Outperforms existing methods on six molecular datasets.
Effective in high class imbalance scenarios.
Utilizes attention mechanisms and distance correlation for improved accuracy.
Abstract
Data sets with imbalanced class sizes, where one class size is much smaller than that of others, occur exceedingly often in many applications, including those with biological foundations, such as disease diagnosis and drug discovery. Therefore, it is extremely important to be able to identify data elements of classes of various sizes, as a failure to do so can result in heavy costs. Nonetheless, many data classification procedures do not perform well on imbalanced data sets as they often fail to detect elements belonging to underrepresented classes. In this work, we propose the BTDT-MBO algorithm, incorporating Merriman-Bence-Osher (MBO) approaches and a bidirectional transformer, as well as distance correlation and decision threshold adjustments, for data classification tasks on highly imbalanced molecular data sets, where the sizes of the classes vary greatly. The proposed technique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
