A generalized flow for multi-class and binary classification tasks: An Azure ML approach
Matthew Bihis, Sohini Roychowdhury

TL;DR
This paper introduces a generalized classification flow on Microsoft Azure ML Studio that efficiently handles multi-class and binary datasets, improving accuracy on diverse real-world data sets.
Contribution
The work presents a novel cloud-based classification flow that reduces multi-class problems to binary tasks and enhances accuracy through data dimensionality reduction and hierarchical classification.
Findings
Achieves 78-97.5% accuracy on public datasets
Attains 85.3-95.7% accuracy on a local diabetic retinopathy dataset
Outperforms existing state-of-the-art methods
Abstract
The constant growth in the present day real-world databases pose computational challenges for a single computer. Cloud-based platforms, on the other hand, are capable of handling large volumes of information manipulation tasks, thereby necessitating their use for large real-world data set computations. This work focuses on creating a novel Generalized Flow within the cloud-based computing platform: Microsoft Azure Machine Learning Studio (MAMLS) that accepts multi-class and binary classification data sets alike and processes them to maximize the overall classification accuracy. First, each data set is split into training and testing data sets, respectively. Then, linear and nonlinear classification model parameters are estimated using the training data set. Data dimensionality reduction is then performed to maximize classification accuracy. For multi-class data sets, data centric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
