A Study on Feature Selection Techniques in Educational Data Mining
M. Ramaswami, R. Bhaskaran

TL;DR
This study evaluates various filter feature selection techniques in educational data mining to identify the most relevant features, aiming to improve predictive accuracy and reduce computational costs in student performance models.
Contribution
It compares six filter feature selection algorithms and benchmarks their effectiveness with different classifiers, identifying the best method and optimal feature subset size.
Findings
Best feature selection method identified based on F-measure and ROC values.
Optimal feature subset reduces computational time and cost.
Predictive accuracy improves with fewer features.
Abstract
Educational data mining (EDM) is a new growing research area and the essence of data mining concepts are used in the educational field for the purpose of extracting useful information on the behaviors of students in the learning process. In this EDM, feature selection is to be made for the generation of subset of candidate variables. As the feature selection influences the predictive accuracy of any performance model, it is essential to study elaborately the effectiveness of student performance model in connection with feature selection techniques. In this connection, the present study is devoted not only to investigate the most relevant subset features with minimum cardinality for achieving high predictive performance by adopting various filtered feature selection techniques in data mining but also to evaluate the goodness of subsets with different cardinalities and the quality of six…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Imbalanced Data Classification Techniques · Online Learning and Analytics
