A Machine Learning Based Ensemble Method for Automatic Multiclass   Classification of Decisions

Liming Fu; Peng Liang; Xueying Li; Chen Yang

arXiv:2105.01011·cs.SE·May 5, 2021

A Machine Learning Based Ensemble Method for Automatic Multiclass Classification of Decisions

Liming Fu, Peng Liang, Xueying Li, Chen Yang

PDF

1 Repo

TL;DR

This paper presents an ensemble machine learning approach to automatically classify software development decisions into five types, improving documentation and understanding of decisions during the software lifecycle.

Contribution

It introduces an ensemble classification method optimized for decision type classification, demonstrating its superiority over base classifiers with specific feature selection and extraction techniques.

Findings

01

Ensemble classifiers outperform base classifiers when well constructed.

02

Feature selection significantly improves classification accuracy.

03

Best results achieved with BoW + 50% features, combining NB, LR, and SVM.

Abstract

Stakeholders make various types of decisions with respect to requirements, design, management, and so on during the software development life cycle. Nevertheless, these decisions are typically not well documented and classified due to limited human resources, time, and budget. To this end, automatic approaches provide a promising way. In this paper, we aimed at automatically classifying decisions into five types to help stakeholders better document and understand decisions. First, we collected a dataset from the Hibernate developer mailing list. We then experimented and evaluated 270 configurations regarding feature selection, feature extraction techniques, and machine learning classifiers to seek the best configuration for classifying decisions. Especially, we applied an ensemble learning method and constructed ensemble classifiers to compare the performance between ensemble…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DoneEI/Automatic-Classification-of-Decisions
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.