Applying a random projection algorithm to optimize machine learning   model for breast lesion classification

Morteza Heidari (1); Sivaramakrishnan Lakshmivarahan (2),; Seyedehnafiseh Mirniaharikandehei (1); Gopichandh Danala (1); Sai Kiran R.; Maryada (2); Hong Liu (1); Bin Zheng (1); ((1) School of Electrical and; Computer Engineering; University of Oklahoma; Norman; OK; USA; (2) School of; Computer Sciences; University of Oklahoma; Norman; OK; USA)

arXiv:2009.09937·cs.CV·February 9, 2021

Applying a random projection algorithm to optimize machine learning model for breast lesion classification

Morteza Heidari (1), Sivaramakrishnan Lakshmivarahan (2),, Seyedehnafiseh Mirniaharikandehei (1), Gopichandh Danala (1), Sai Kiran R., Maryada (2), Hong Liu (1), Bin Zheng (1), ((1) School of Electrical and, Computer Engineering, University of Oklahoma, Norman, OK, USA

PDF

TL;DR

This study demonstrates that applying a random projection algorithm to reduce feature dimensionality significantly improves machine learning model performance in classifying breast lesions from mammograms.

Contribution

The paper introduces the use of a random projection algorithm for feature reduction, enhancing SVM-based classification of breast lesions in medical imaging.

Findings

01

Random projection outperforms PCA, NMF, and Chi-squared methods in classification accuracy.

02

The SVM model with random projection achieved an AUC of 0.84 in lesion classification.

03

Random projection effectively generates optimal feature vectors for medical image analysis.

Abstract

Machine learning is widely used in developing computer-aided diagnosis (CAD) schemes of medical images. However, CAD usually computes large number of image features from the targeted regions, which creates a challenge of how to identify a small and optimal feature vector to build robust machine learning models. In this study, we investigate feasibility of applying a random projection algorithm to build an optimal feature vector from the initially CAD-generated large feature pool and improve performance of machine learning model. We assemble a retrospective dataset involving 1,487 cases of mammograms in which 644 cases have confirmed malignant mass lesions and 843 have benign lesions. A CAD scheme is first applied to segment mass regions and initially compute 181 features. Then, support vector machine (SVM) models embedded with several feature dimensionality reduction methods are built…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSupport Vector Machine